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I. Basis fther p rt 

1 . With regard to the elements of the international application (Replacement sheets which have been furnished to 
the receiving Office in response to an invitation under Article 14 are referred to in this report as "originally filed" 
and are not annexed to this report since they do not contain amendments (Rules 70, 16 and 70.17)): 
Description, pages: 

1 -47 as originally filed 

Claims, No.: 

1 -61 as originally filed 

Sequence listing part of the description, pages: 

1-8, filed with the demand 

2. With regard to the language, all the elements marked above were available or furnished to this Authority In the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

H contained in the international application in written form. 

□ filed together with the intemational application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

□ furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure In 
the international application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been fumished. 

4. The amendments have resulted in the cancellation of: \ 

□ the description, pages: ' 

□ the claims, Nos.: 

□ the drawings, sheets: 
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5, □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under Item 1 and annexed to this 
report,) 

6. Additional observations, if necessary: 



V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
<;itations and explanations supporting such statement 

1. Statement 

Novelty (N) Yes: Claims 20, 26, 57 

No: Claims 1 -1 9, 21-25, 27-56, 58-61 

Inventive step (IS) Yes: Claims 

No: Claims 1-61 

Industrial applicability (lA) Yes: Claims 1-61 

No: Claims 



2. Citations and explanations 
see separate sheet 
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1. Cit d docum nts 

D1: US-A-5435730 
D2: EP-A2-0965641 
D3: WO-A2-0009705 

2. Re Item V 

Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive 
step or industrial applicability; citations and explanations supporting such 
statement 

2.1 . Clainn 1 is directed to a method wherein a gene of interest is introduced into a host 
cell, integrated into the chromosome of said cell and amplified. 

D1 discloses recombinant DMA molecules comprising the Streptomyces gal 
operon (the galE, galT, galK genes and their promoters) or parts of it. D1 further 
teaches the use of the gal operon as a selection marker in a host mutant which 
contains a nonfunctional gal operon: a recombinant DNA molecule comprising the 
gal operon and a gene of interest can be transformed into a host cell and 
integrated by homologous recombination. This method allows the expression of a 
gene of interest without the need of an antibiotic selection (col.7, 1.7-51). 
Even though D1 does not mention the use of the gal operon for the amplification 
of the gene of interest, it appears that the disclosed recombinant molecules would 
be suitable for the claimed method. Hence, D1 is prejudicial to the novelty of 
claims 1-10, 15-19, 32-36, 44-50, 52-56 and 59 (Art. 33(2) PCT). 

2.2. D2 describes how a promoterless foreign gene can be inserted into the lac operon 
of S. thermophilus, transformed into S. thermophilus, integrated into the host 
genome and expressed. The erythromycin resistance gene used to select 
integration events is released by a second recombination event, resulting in a 
perfect replacement of the lac operon and a stable expression of the gene! of 
interest (example 1 and claims). \ 
Thus, D2 is novelty destroying to the subject-matter of claims 2, 4, 16-19, 21-25, 
27-32, 38-46, 55, 56, 58-61 (Art. 33(2) PCT). ' 

2.3. D3, cited in the application, teaches the use of galE for selection of transformants 
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on a medium containing galactose (p.43, 1.25- p.46, 1.28) and gives several 
examples of transformed plants which have stably integrated the gene of interest. 
Hence, in D3 the same transformation method as in the application is used which 
has the advantage of circumventing the need for antibiotic selection. 
Thus, D3 is novelty destroying to the subject-matter of claims 2, 5-15, 17, 24, 25, 
30-37, 41, 44, 45, 47-54, 59 and 60 (Art. 33(2) PCT). 

2.4. The remaining dependent claims, if novel, do not appear to recite any technical 
features that would Justify an inventive step (Art. 33(3) PCT). 

2.5. The subject-matter of claims 1-61 is industrially applicable in the field of 
microbiology (Art. 33(4) PCT). 

2.6. Hence, the idea of using the gal operon as a selection marker in order to avoid the 
need for antibiotics was known in the art and even though none of the cited prior 
art documents mentions its use for gene amplification, the described systems 
appear to be suitable for gene amplification and thus prejudicial to the novelty of 
the claims as formulated. Indeed, the claimed expression systems do not contain 
any technical features that could differentiate them from the prior art. 
Moreover, the claims do not meet the requirements of Article 6 PCT in that the 
matter for which protection is sought is not clearly defined: the description only 
gives the example of the galE operon, whereas the claims vaguely claim any 
"amplification unit". This vague term does not enable the skilled person to 
determine which technical features are necessary to perform the stated function, 
i.e. the amplification of the gene of interest. 

2.7. Although claims 1 and 3 have been drafted as separate independent claims, they 
appear to relate effectively to the same subject-matter and to differ from each 
other only with regard to the definition of the subject-matter for which protection is 
sought in respect of the terminology used for the features of that subject-matter. 
The aforementioned claims therefore lack conciseness. Moreover, lack of clarity of 
the claims as a whole arises, since the plurality of independent claims makes it 
difficult, if not impossible, to determine the matter for which protection is sought, 
and places an undue burden on others seeking to establish the extent of the 
protection. 
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Hence, the claims do not meet the requirements of Article 6 PCX. 

In order to overcome this objection, it would appear appropriate to file an 
amended set of claims defining the relevant subject-matter in terms of a minimum 
number of independent claims in each category followed by dependent claims 
covering features which are merely optional (Rule 6.4 PCT). 

2.8. Contrary to the requirements of Rule 5.1 (a)(ii) PCT, the relevant background art 
disclosed in the documents D1 and D2 is not mentioned in the description, nor are 
these documents identified therein. 



1 
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A sensor membrane comprising at ieast one polymer material, af least one 
surfactant and at least one hydropliilic compound, which is not a polymer, in 
admixture. 

A method for the preparation of a sensor membrane comprising the steps of: 

mixing at least one polymer material, at least one surfactant, at least on 
hydrophilic compoundrwhich is not a polymer, and at least one solvent 

applying said mixture onto a substrate 

solvent evaporation, and 

optionally membrane conditioning 
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O (54) Title: METHOD FOR INCREASING GENE COPY NUMBER IN A HOST CELL AND RESULTING HOST CELL 

2 (57) Abstract: The invention relates to a method for increasing the copy number of a chromosomally integrated expression cassette 
in a microbial strain without leaving antibiotic resistance markers behind in the strain, the necessary genetic constructs, and the 
O strains resulting from the method of the invention. In the method an expression cassette comprising a gene of interest and a copy 
^ of a gene being non-functional in the chromosome of the host cell is introduced in the host cell. The host cell is cultivated in the 
1^ presence of a precursor to an inhibiting compound produced if the expression cassette is not integrated into the chromosome. 
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Method for Increasing Gene Copy Nmiiber in a host cell and 
resulting host cell 

Field of the Invention 

The invention relates to a method for increasing the copy 
number of a chromosomally integrated expression cassette in a 
microbial strain without leaving antibiotic resistance markers 
behind in the strain, the necessary genetic constructs, and the 
strains used in and resulting from the method of the invention. 
It is desirable for the biotech industry to provide microbial 
strains devoid of antibiotic resistance markers comprising 
several chromosomally integrated copies of a gene of interest, 
for the industrial high yield production of polypeptides. 

Background of the Invention 

The present debate concerning the industrial use of 
recombinant DNA technology has raised some questions and 
concern about the use of antibiotic marker genes. An antibiotic 
marker gene is traditionally used as a means to select for 
strains carrying multiple copies of both the marker gene and an 
accompanying expression cassette coding for a polypeptide of 
industrial interest. Amplification of the expression cassette 
by increasing the copy number in a microbiological production 
strain is desirable because there is very often a direct 
correlation between the number of copies and the final product 
yields. The amplification method using antibiotic selection has 
been used extensively in many host strains over the past 15 
years and has proven to be a very efficient way to develop high 
yielding production strains in a relatively short time, 
irrespective of. the expression level of the individual 
expression cassettes. 

In order to comply with the current demand for recombinant 
production host strains devoid of antibiotic markers, we have 
looked for possible alternatives to the present technology that 
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will allow substitution of the anti h-ir^i-^ 

»lth new ^rker genes. -'^bxot.= „«rfcers „a use tc^y 

The ctabolic pathway of galactose in bacilli is verv 
s..Uar to the pathway of other auga«. The carbon .olecule ll 

molecule „.th a phosphate group and a transferase relct.^n 
transfers the phosphate group to a glucose molecule Teh "s 
then shuttled directly into the glycolytic pathway, m the ca e 
Of ga aotose catabolis™ the transferase reaction generates 
IT, as a sideproduct which is a very toxic co^ound for 
all Uv.ng cells. This compound is nor^Uy converted to ODP 
glucose by an epi.erase coded for by the galK gene, ^e use of 
galE rn a sro^le selection .ethod for plas.id transfor,^d 
cells, especially plant cells, is mentioned in „o 00/09705. 

Summary of the Invention 

The problem to be solved by the present invention is to 
-crease the copy nu^er of a chromoso^lly integrated 
egression cassette in a microbial strain in a way by which a 
resultrng host cell devoid of antibiotic ^rlcers is provided 
fcr^ the use xn industrial production of polypeptides in high 

The solution is based on that the present inventors 
de^nstrated that a nucleotide construct con^rising an 
aoplafrcat.on unit as defined herein can integrate into the 
chromosome of a host cell and increase in number of 
chromosomally integrated copies without the use of classical 
antibiotic markers or antibiotics. 

Accordingly, in a first aspect the invention relates to a 
method for increasing the number of copies of an amplification 
unit integrated into a host cell chromosome, wherein the method 
comprises the steps of: 

a) rendering a chromosomal gene of a host cell non- 
functional, wherein the host cell becomes susceptible to an 
:Lnhibitory compound endogenously produced by the host cell 
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when the host cell is cultivated in a medium comprising a 
precursor; 

b) making a nucleic acid construct comprising an 
amplification unit, wherein the unit comprises: 
5 i) an expression cassette comprising at least one copy of 

a gene of interest; and 
ii) an expressable copy of the chromosomal gene of step 
a) , wherein the unit integrates into the host cell 
chromosome ; 

10 c) introducing the nucleic acid construct of step b) into the 
host cell of step a) , wherein at least one copy of the 
amplification unit integrates into the host cell 
chromosome; 

d) cultivating the host cell of step c) in a medium 
15 comprising the precursor, wherein a chromosomally 

integrated copy of the amplification unit is duplicated or 
multiplied on the host cell chromosome; 

e) selecting a host cell comprising two or more chromosomally 
integrated copies of the amplification unit; and optionally 

20 f ) performing one or more cycles of steps d) and e) using the 
host cell selected in step e) in each new cycle; wherein 
the number of chromosomally integrated copies of the 
amplification unit increases with each repeat. 
Further, in a second aspect the invention relates to a 
25 method for constructing a host cell comprising at least one 
copy of an amplification unit integrated into the host cell 
chromosome, wherein the method comprises the steps of: 

a) rendering a chromosomal gene of a host cell non- 
functional, wherein the host cell becomes susceptible to an 

30 inhibitory compound endogenously produced by the host cell 

when the host cell is cultivated in a medium comprising a 
precursor; 

b) making a nucleic acid construct comprising an 
amplification unit, wherein the unit comprises: 

35 i) an expression cassette comprising at least one copy of 

a gene of interest; and 
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ii) an expressable copy of the chromosomal gene of step 
a) , wherein the unit integrates into the host cell 
chromosome ; 

c) introducing the nucleic acid construct of step b) into the 
host cell of step a) and cultivating the host cell in a 
medium, comprising the precursor, wherein at least one copy 
of the amplification unit integrates into the host cell 
chromosome ; and 

d) selecting a host cell comprising at least one 
chromosomally integrated copy of the att^lif ication unit. 

A third aspect of the invention relates to a method for 
increasing the number of copies of an amplification unit 
integrated into a host cell chromosome, wherein the method 
comprises the steps of: 

a) providing a host cell, wherein a chromosomal gene has been 
rendered non- functional, whereby the host cell becomes 
susceptible to an inhibitory compound endogenously produced 
by the host cell when the host cell is cultivated in a 
medium comprising a precursor; 

b) introducing a nucleic acid construct into the host 'cell of 
step a) , the nucleic acid construct comprising an 
amplification unit, wherein the unit comprises: 

i) an expression cassette comprising at least one copy of 
a gene of interest; and 

ii) an expressable copy of the chromosomal gene of step 

a), 

wherein at least one copy of the amplification unit 
integrates into the host cell chromosome; 

c) cultivating the host cell of step b) in a medium 
comprising the precursor, wherein a chromosomally 
integrated copy of the amplification unit is duplicated or 
multiplied on the host cell chromosome; 

d) selecting a host cell comprising two or more chromosomally 
integrated copies of the amplification unit; and optionally 

e) performing one or more cycles of steps c) and d) using the 
host cell selected in step d) in each new cycle; wherein the 
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number of chromosomal ly integrated copies of the amplification 
unit increases with each cycle. 

As clear from above, genetic tools are provided for 
performing the method of the invention as described herein. 
5 Accordingly in a fourth aspect the invention relates to an 

amplification unit comprising: 

a) an expression cassette comprising at least one copy of a 
gene of interest; and 

b) an expressable copy of a conditionally essential chromosomal 
10 gene of a host cell; wherein the unit integrates into the 

host cell chromosome upon introduction of the nucleic acid 
construct into the host cell. 

Further in a fifth aspect the invention relates to a 
nucleic acid construct comprising a unit as defined in any of 
15 the previous aspects. 

The method of the invention achieves the construction of a 
host cell comprising at least one chromosomally integrated copy 
of the amplification unit as defined above, where such a host 
cell is" highly desirable for industrial production of 
20 polypeptides in high yields . 

Consequently in a sixth aspect the invention relates to a 
host cell wherein a chromosomal gene has been rendered non- 
functional leaving the host cell susceptible to an inhibitory 
compoxind endogenously produced by the host cell when cultivated 
25 in a medium comprising a precursor; and wherein the host cell 
comprises an amplification unit as defined in any of the 
previous aspects or a nucleotide construct as defined in the 
previous aspect. 

In a final aspect the invention relates to a process for 
30 producing a polypeptide of interest, wherein the process 
comprises a step of cultivating a host cell as defined in the 
previous aspect . 

Drawings 

35 Figure 1 : Shows a Southern blot which demonstrated 
hybridization to flanking fragments of the dal locus and a 
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strong hybridization band to the expression cassette 
corresponding to the size of the plasmid pMOL1807 (SEQ ID 2) in 
a non-limiting example herein. 

5 Definitions 

In accordance with the present invention there may be 
employed conventional molecular biology, microbiology, and 
recombinant DNA techniques within the skill of the art. Such 
techniques are explained fully in the literature. See, e g 
) Sambrook, Fritsch . Maniatis, Molecular Cloning: A Laborator; 
Manual, Second Edition (1989) Cold Spring Harbor Laboratory 
Press, cold Spring Harbor, New York (herein "Sambrook et al 
1989") DNA Cloning: A Practical Approach, Volumes I and 11 
/D.N. Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait 
ed. 1984); Nucleic Acid Hybridization (B.D. Hames & sj 
Higgins eds (1985)); Transcription And Translation (B.D. Hames 
& S.J. Higgins, eds. (1984)); Animal Cell Culture (R i 
Freshney, ed. (1986)); Immobilized Cells And Enzymes (irl 
Press, (1986)); B. Perbal, a Practical Guide To Molecular 
Cloning (1984) . 

A ^polynucleotide" is a single- or double -stranded polymer 
of deoxyribonucleotide or ribonucleotide bases read from the 5' 
to the 3' end. Polynucleotides include RNA and DNA, and may be 
isolated from natural sources, synthesized in vitro, or 
prepared from a combination of natural and synthetic molecules. 

A "nucleic acid molecule" or "nucleotide sequence" refers 
to the phosphate ester polymeric form of ribonucleosides 
(adenosine, guanosine, uridine or cytidine; «RNA molecules") or 
deoxyribonucleosides (deoxyadenosine, deoxyguanosine, 

deoxythymidine, or deoxycytidine; «DNA molecules") in either 
single stranded form, or a double-stranded helix. Double 
stranded DNA-DNA, DNA-RNA and RNA-RNA helices are possible. The 
term nucleic acid molecule, and in particular DNA or RNA 
molecule, refers only to the primary and secondary structure of 
the molecule, and does not limit it to any particular tertiary 
or quaternary forms. Thus, this term includes double- stranded 
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DNA found, inter alia, in linear or circular DNA molecules 
(e.g., restriction fragments), plasmids, and chromosomes. In 
discussing the structure of particular double -stranded DNA 
molecules, sequences may be described herein according to the 
5 normal convention of giving only the sequence in the 5' to 3' 
direction along the nontranscribed strand of DNA (i.e., the 
strand having a sequence homologous to the mRNA) . A 
"recombinant DNA molecule" is a DNA molecule that has undergone 
a molecular biological manipulation. 

10 A nucleic acid molecule is "hybridizable" to another 

nucleic acid molecule, such as a cDNA, genomic DNA, or RNA, 
when a single stranded form of the nucleic acid molecule can 
anneal to the other nucleic acid molecule under the appropriate 
conditions of temperature and solution ionic strength (see 

15 Sambrook et al . , supra). The conditions of temperature and 
ionic strength determine the "stringency" of the hybridization, 
A DNA "coding sequence" or an "open reading frame (ORF) " 
is a double- stranded DNA sequence which is transcribed and 
translated into a polypeptide in a cell in vitro or in vivo 

20 when placed under the control of appropriate regulatory 
sequences. The boundaries of the coding sequence are determined 
by a start codon at the 5' (amino) terminus and a translation 
stop codon at the 3' (carboxyl) terminus. A coding sequence can 
include, but is not limited to, prokaryotic sequences, cDNA 

25 from eukaryotic mRNA, genomic DNA sequences from eukaryotic 
(e.g., mammalian) DNA, and even synthetic DNA sequences. If the 
coding sequence is intended for expression in a eukaryotic 
cell, a polyadenylation signal and transcription termination 
sequence will usually be located 3' to the coding sequence. 

30 An expression vector is a DNA molecule, linear or 

circular, that comprises a segment encoding a polypeptide of 
interest operably linked to additional segments that provide 
for its transcription. Such additional segments may include 
promoter and terminator sequences, and optionally one or more 

35 origins of replication, one or more selectable markers, an 
enhancer, a polyadenylation signal, and the like. Expression 
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"'Operably linked", when referring to DNA segments, 
indicates that the segments are arranged so that they function 
in concert for their intended purposes, e.g. transcription 
initiates in the promoter and proceeds through the coding 
5 segment to the terminator* 

A coding sequence is ''under the control" of 
transcriptional and translational control sequences in a cell 
when RNA polymerase transcribes the coding sequence into mRNA, 
which is then trans -RNA spliced and translated into the protein 
10 encoded by the coding sequence, 

"Heterologous" DNA refers to DNA not naturally located in 
the cell, or in a chromosomal site of the cell. Preferably, the 
heterologous DNA includes a gene foreign to the cell. 

As used herein the term "nucleic acid construct" is 
15 intended to indicate any nucleic acid molecule of cDNA, genomic 
DNA, synthetic DNA or RNA origin. The term "construct" is 
intended to indicate a nucleic acid segment which may be 
single- or double -stranded, and which may be based on a 
complete or partial naturally occurring nucleotide sequence 
20 encoding a polypeptide of interest. The construct may 
optionally contain other nucleic acid segments. 

The nucleic acid construct of the invention encoding the 
polypeptide of the invention may suitably be of genomic or cDNA 
origin, for instance obtained by preparing a genomic or cDNA 
25 library and screening for DNA sequences coding for all or part 
of the polypeptide by hybridization using synthetic 
oligonucleotide probes in accordance with standard techniques 
(cf. Sambrook et al., supra). 

The nucleic acid construct of the invention encoding the 
30 polypeptide may also be prepared synthetically by established 
standard methods, e.g. the phosphoamidite method described by 
Beaucage and Caruthers, Tetrahedron Letters 22 (1981), 1859 - 
1869, or the method described by Matthes et al., EMBO Journal 3 
(1984), 801 - 805. According to the phosphoamidite method, 
35 oligonucleotides are synthesized, e.g. in an automatic DNA 
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synthesizer, purified, annealed, ligated and cloned in suitable 
vectors. 

Furthermore, the nucleic acid construct may be of mixed 
synthetic and genomic, mixed synthetic and cDNA or mixed 
5 genomic and cDNA origin prepared by ligating fragments of 
synthetic, genomic or cDNA origin (as appropriate) , the 
fragments corresponding to various parts of the entire nucleic 
acid construct, in accordance with standard techniques. The 
nucleic acid construct may also be prepared by polymerase chain 

10 reaction using specific primers, for instance as described in 
US 4,683,202 or Saiki et al., Science 239 (1988), 487 - 491. 

The term nucleic acid construct may be synonymous with the 
term "expression cassette" when the nucleic acid construct 
contains the control sequences necessary for expression of a 

15 coding sequence of the present invention 

The term "control sequences" is defined herein to include 
all components which are necessary or advantageous for 
expression of the coding sequence of the nucleic acid sequence. 
Each control sequence may be native or foreign to the nucleic 

20 acid sequence encoding the polypeptide. Such control sequences 
include, but are not limited to, a leader, a polyadenylation 
sequence, a propeptide sequence, a promoter, a signal sequence, 
and a transcription terminator. At a minimum, the control 
sequences include a promoter, and transcriptional and 

25 translational stop signals. The control sequences may be 
provided with linkers for the purpose of introducing specific 
restriction sites facilitating ligation of the control 
sequences with the coding region of the nucleic acid sequence 
encoding a polypeptide. 

30 The control sequence may be an appropriate promoter sequence, a 
nucleic acid sequence which is recognized by a host cell for 
expression of the nucleic acid sequence. The promoter sequence 
contains transcription and translation control sequences which 
mediate the expression of the polypeptide. The promoter may be 

35 any nucleic acid sequence which shows transcriptional activity 
in the host cell of choice and may be obtained from genes 
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the gene for the alpha-factor from Saccharomyces cerevisiae, an 
amylase or a protease gene from a Bacillus species, or the calf 
preprochymosin gene. However, any signal peptide coding region 
capable of directing the expressed polypeptide into the 
5 secretory pathway of a host cell of choice may be used in the 
pr e s ent invent i on , 

The control sequence may also be a propeptide coding 
region, which codes for an amino acid sequence positioned at 
the amino terminus of a polypeptide. The resultant polypeptide 

10 is known as a proenzyme or propolypeptide (or a zymogen in some 
cases) . A propolypeptide is generally inactive and can be 
converted to mature active polypeptide by catalytic or 
autocatalytic cleavage of the propeptide from the 
propolypeptide. The propeptide coding region may be obtained 

15 from the Bacillus subtilis alkaline protease gene (aprE) , the 
Bacillus subtilis neutral protease gene (nprT) / the 
Saccharomyces cerevisiae alpha- factor gene, or the 
Myceliophthora thermophilum laccase gene (WO 95/33836) . 

It may also be desirable to add regulatory sequences which 

20 allow the regulation of the expression of the polypeptide 
relative to the growth of the host cell. Examples of 
regulatory systems are those which cause the expression of the 
gene to be turned on or off in response to a chemical or 
physical stimulus, including the presence of a regulatory 

25 compound. Regulatory systems in prokaryotic systems would 
include the lac, tac, and trp operator systems. In yeast, the 
ADH2 system or GALl system may be used. In filamentous fiingi, 
the TAKA alpha-amylase promoter, Aspergillus niger glucoamylase 
promoter, and the Aspergillus oryzae glucoamylase promoter may 

30 be used as regulatory sequences. Other examples of regulatory 
sequences are those which allow for gene amplification. In 
eukaryotic systems, these include the dihydrofolate reductase 
gene which is amplified in the presence of methotrexate, and 
the metallothionein genes which are amplified with heavy 

35 metals. In these cases, the nucleic acid sequence encoding the 
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polypeptide would be placed in tandem with the regulatory 
sequence . 

Examples of suitable promoters for directing the 
transcription of the nucleic acid constructs of the present 
invention, especially in a bacterial host cell, are the 
promoters obtained from the E. coli lac operon, the 
Streptomyces coelicolor agarase gene (dagA) , the Bacillus 
subtilis levansucrase gene (sacB) , the Bacillus subtilis 
alkaline protease gene, the Bacillus licheniformis alpha- 
amylase gene (amyL) , the Bacillus stearothermophilus maltogenic 
amylase gene (amyM) , the Bacillus amyloliquef aciens alpha- 
amylase gene (amyQ) , the Bacillus amyloliquef aciens BAN AMYLASE 
GENE, the Bacillus licheniformis penicillinase gene (penP) , the 
Bacillus subtilis xylA and xylB genes, and the prokaryotic 
beta- lactamase gene (Villa-Kamaroff et al., 1978, Proceedings 
of the National Academy of Sciences USA 75:3727-3731), as well 
as the tac promoter (DeBoer et al., 1983, Proceedings of the 
National Academy of Sciences USA 80:21-25). Further promoters 
are described in "Useful proteins from recombinant bacteria" in 
Scientific American, 1980, 242:74-94; and in Sambrook et al . , 
1989, supra. 

Examples of suitable promoters for directing the 
transcription of the nucleic acid constructs of the present 
invention in a filamentous fungal host cell are promoters 
obtained from the genes encoding Aspergillus oryzae TAKA 
amylase, Rhizomucor miehei aspartic proteinase, Aspergillus 
niger neutral alpha-amylase, Aspergillus niger acid stable 
alpha -amylase, Aspergillus niger or Aspergillus awamori 
glucoamylase (glaA) , Rhizomucor miehei lipase, Aspergillus 
oryzae alkaline protease, Aspergillus oryzae triose phosphate 
isomerase, Aspergillus nidulans acetamidase, Fusarium oxysporum 
trypsin-like protease (as described in U.S. Patent No. 
4,288,627, which is incorporated herein by reference), and 
hybrids thereof. Particularly preferred promoters for use in 
filamentous fungal host cells are the TAKA amylase, NA2-tpi (a 
hybrid of the promoters from the genes encoding Aspergillus 
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niger neutral a-amylase and Aspergillus oryzae triose phosphate 
isomerase) , and glaA promoters. Further suitable promoters for 
use in filamentous fungus host cells are the ADH3 promoter 
(McKnight et al . , The EMBO J. 4 (1985), 2093 - 2099) or the 
5 tpiA promoter. 

Preferred terminators for filamentous fimgal host cells are 
obtained from the genes encoding Aspergillus oryzae TAKA 
amylase, Aspergillus niger glucoamylase, Aspergillus nidulans 
anthranilate synthase, Aspergillus niger alpha-glucosidase, and 

10 Fusarium oxysporum trypsin-like protease, for fungal hosts) the 
TPIl (Alber and Kawasaki, op. cit.) or ADH3 (McKnight et al., 
op . cit . ) terminators . 

Preferred terminators for yeast host cells are obtained 
from the genes encoding Saccharomyces cerevisiae enolase, 

15 Saccharomyces cerevisiae cytochrome C (CYCl) , or Saccharomyces 
cerevisiae glyceraldehyde- 3 -phosphate dehydrogenase . Other 
useful terminators for yeast host cells are described by 
Romanes et al . , 1992, supra. 

An effective signal peptide coding region for bacterial 

20 host cells is the signal peptide coding region obtained from 
the maltogenic amylase gene from Bacillus NCIB 11837, the 
Bacillus stearothermophilus alpha-amylase gene, the Bacillus 
licheniformis subtilisin gene, the Bacillus lichenif ormis beta- 
lactamase gene, the Bacillus stearothermophilus neutral 

25 proteases genes (nprT, nprS, nprM) , and the Bacillus subtilis 
PrsA gene. Further signal peptides are described by Simonen 
and Palva, 1993, Microbiological Reviews 57:109-137. 

An effective signal peptide coding region for filamentous 
fungal host cells is the signal peptide coding region obtained 

30 from Aspergillus oryzae TAKA amylase gene, Aspergillus niger 
neutral amylase gene, the Rhizomucor miehei aspartic proteinase 
gene, the Humicola lanuginosa cellulase or lipase gene, or the 
Rhizomucor miehei lipase or protease gene, Aspergillus sp. 
amylase or glucoamylase, a gene encoding a Rhizomucor miehei 

35 lipase or protease. The signal peptide is preferably derived 
from a gene encoding A. oryzae TAKA amylase, A. niger neutral 
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a-amylase, A. niger acid-stable amylase, or A. niger 
glucoamylase , 

The present invention also relates to recombinant expression 
vectors comprising a nucleic acid sequence of the present 
5 invention, a promoter, and transcriptional and translational 
stop signals. The various nucleic acid and control sequences 
described above may be joined together to produce a recombinant 
expression vector which may include one or more convenient 
restriction sites to allow for insertion or siibstitution of the 

10 nucleic acid sequence encoding the polypeptide at such sites. 
Alternatively, the nucleic acid sequence of the present 
invention may be expressed by inserting the nucleic acid 
sequence or a nucleic acid construct comprising the sequence 
into an appropriate vector for expression. In creating the 

15 expression vector, the coding sequence is located in the vector 
so that the coding sequence is operably linked with the 
appropriate control sequences for expression, and possibly 
secretion. 

The recombinant expression vector may be any vector (e.g., 

20 a plasmid or virus) which can be conveniently sxibjected to 
recombinant DNA procedures and can bring about the expression 
of the nucleic acid sequence. The choice of the vector will 
typically depend on the compatibility of the vector with the 
host cell into which the vector is to be introduced. The 

25 vectors may be linear or closed circular plasmids. The vector 
may be an autonomously replicating vector, i.e., a vector which 
exists as an extrachromosomal entity, the replication of which 
is independent of * chromosomal replication, e.g., a plasmid, an 
extrachromosomal element, a minichromosome, or an artificial 

30 chromosome. The vector may contain any means for assuring 
self -replication. Alternatively, the vector may be one which, 
when introduced into the host cell, is integrated into the 
genome and replicated together with the chromosome (s) into 
which it- has been integrated. The vector system may be a 

35 single vector or plasmid or two or more vectors or plasmids 
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which together contain the total DNA to be introduced into the 
genome of the host cell, or a transposon. 

The vectors of the present invention preferably contain 
one or more selectable markers which permit easy selection of 
5 transformed cells. A selectable marker is a gene the product 
of which provides for biocide or viral resistance, resistance 
to heavy metals, prototrophy to auxotrophs, and the like. 

A conditionally essential gene may function as a 
selectable marker. Examples of bacterial conditionally 

10 essential selectable markers are the dal genes from Bacillus 
subtilis or Bacillus licheniformis, that are only essential 
when the bacterium is cultivated in the presence of D- alanine; 
or the genes encoding enzymes involved in the removal of UDP- 
galactose from the bacterial cell when the cell is grown in the 

15 presence of galactose. Non-limiting examples of such genes are 
those from B. subtilis or B. licheniformis encoding UTP- 
dependent phosphorylase (EC 2.7.7.10), UDP-glucose-dependent 
uridylyltransf erase (EC 2.7.7.12), or UDP-galactose epimerase 
(EC 5,1.3.2) . 

20 Antibiotic selectable markers confer antibiotic resistance 

to such antibiotics as ampicillin, kanamycin, chloramphenicol, 
tetracycline, neomycin, hygromycin or methotrexate. Suitable 
markers for yeast host cells are ADE2, HIS3, LEU2, LYS2, MET3, 
TRPl, and URA3 . A selectable marker for use in a filamentous 

25 fungal host cell may be selected from the group including, but 
not limited to, amdS (acetamidase) , argB (ornithine 
carbamoyl transferase) , bar (phosphinothricin 

acetyltransferase) , hygB (hygromycin phosphotransferase) , niaD 
(nitrate reductase), py^G ( orotidine- 5 ' -phosphate 

30 decarboxylase) , sC (sulfate adenyltransf erase) , trpC 
(anthranilate synthase) , and glufosinate resistance markers, as 
well as equivalents from other species. Preferred for use in 
an Aspergillus cell are the amdS and pyrG markers of 
Aspergillus nidulans or Aspergillus oryzae and the bar marker 

35 of Streptomyces hygroscopicus. Furthermore, selection may be 
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accomplished by co-transformation, e.g., as described in WO 
91/17243, where the selectable marker is on a separate vector. 

The vectors of the present invention preferably contain an 
element (s) that permits stable integration of the vector, or of 
a smaller part of the vector, into the host cell genome or 
autonomous replication of the vector in the cell independent of 
the genome of the cell. 

The vectors, or smaller parts of the vectors such as 
amplification units of the present invention, may be integrated 
into the host cell genome when introduced into a host cell. 
For chromosomal integration, the vector may rely on the nucleic 
acid sequence encoding the polypeptide or any other element of 
the vector for stable integration of the vector into the genome 
by homologous or nonhomologous recombination. 

Alternatively, the vector may contain additional nucleic 
acid sequences for directing integration by homologous 
recombination into the genome of the host cell. The additional 
nucleic acid sequences enable the vector to be integrated into 
the host cell genome at a precise location (s) in the 
chromosome (s) . To increase the likelihood of integration at a 
precise location, the integrational elements should preferably 
contain a sufficient number of nucleic acids, such as 100 to 
1,500 base pairs, preferably 400 to 1,500 base pairs, and most 
preferably 800 to 1,500 base pairs, which are highly homologous 
with the corresponding target sequence to enhance the 
probability of homologous recombination. The integrational 
elements may be any sequence that is homologous with the target 
sequence in the genome of the host cell. Furthermore, the 
integrational elements may be non-encoding or encoding nucleic 
acid sequences. 

On the other hand, the vector may be integrated into the 
genome of the host cell by non-homologous recombination. These 
nucleic acid sequences may be any sequence that is homologous 
with a target sequence in the genome of the host cell, and, 
furthermore, may be non-encoding or encoding sequences. The 
copy number of a vector, an expression cassette, an 
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amplification unit, a gene or indeed any defined nucleotide 
sequence is the number of identical copies that are present in 
a host cell at any time. A gene or another defined chromosomal 
nucleotide sequence may be present in one, two, or more copies 
on the chromosome. An autonomously replicating vector may be 
present in one, or several hundred copies per host cell. 

An amplification unit of the invention is a nucleotide 
sequence that can integrate into the chromosome of a host cell, 
whereupon it can increase in number of chromosomally integrated 
copies by duplication of multiplication. The unit comprises an 
expression cassette as defined herein comprising at least one 
copy of a gene of interest and an expressable copy of a 
chromosomal gene, as defined herein, of the host cell. 

For autonomous replication, the vector may further 
comprise an origin of replication enabling the vector to 
replicate autonomously in the host cell in question. Examples 
of bacterial origins of replication are the origins of 
replication of plasmids pBR322, pUC19, pACYCl77, pACYC184, 
pUBllO, pE194, PTA1060, and pAMSl. Examples of origin of 
replications for use in a yeast host cell are the 2 micron 
origin of replication, the combination of CEN6 and ARS4, and 
the combination of CEN3 and ARSl. The origin of replication 
may be one having a mutation which makes its functioning 
temperature-sensitive in the host cell (see, e.g., Ehrlich, 
. 1978, Proceedings of the National Academy of Sciences USA 
75:1433). 

The present invention also relates to recombinant host cells, 

comprising a nucleic acid sequence of the invention, which are 

advantageously used in the recombinant production of the 
0 polypeptides. The term "host cell" encompasses any progeny of 

a parent cell which is not identical to the parent cell due to 

mutations that occur during replication. 

The cell is preferably transformed with a vector 

comprising a nucleic acid sequence of the invention followed by 
5 integration of the vector into the host chromosome. 

"Transformation" means introducing a vector comprising a 
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nucleic acid sequence of the present invention into a host cell 
so that the vector is maintained as a chromosomal integrant or 
as a self -replicating extra- chromosomal vector. Integration is 
generally considered to be an advantage as the nucleic acid 
5 sequence is more likely to be stably maintained in the cell . 
Integration of the vector into the host chromosome may occur by 
. homologous or non-homologous recombination as described above. 

The choice of a host cell will to a large extent depend 
upon the gene encoding the polypeptide and its source. The 

10 host cell may be a unicellular microorganism, e.g., a 
prokaryote, or a non-unicellular microorganism, e.g., a 
eukaryote. Useful unicellular cells are bacterial cells such 
as gram positive bacteria including, but not limited to, a 
Bacillus cell, e.g.. Bacillus alkalophilus. Bacillus 

15 amyloliquef aciens. Bacillus brevis, Bacillus circulans. 
Bacillus coagulans, Bacillus lautus. Bacillus lentus. Bacillus 
licheniformis. Bacillus megaterium. Bacillus 

stearothermophiluB, Bacillus subtilis, and Bacillus 
thuringiensis; or a Streptomyces cell, e.g., Streptomyces 

20 lividans or Streptomyces murinus, or gram negative bacteria 
such as E. coli and Pseudomonas sp. In a preferred embodiment, 
the bacterial host cell is a Bacillus lentus. Bacillus 
licheniformis. Bacillus stearothermophilus or Bacillus subtilis 
cell . 

25 The transformation of a bacterial host cell may, for 

instance, be effected by protoplast transformation (see, e.g., 
Chang and Cohen, 1979, Molecular General Genetics 168:111-115), 
by using competent cells (see, e.g.. Young and Spizizin, 1961, 
Journal of Bacteriology 81:823-829, or Dubnar and Davidoff- 

30 Abelson, 1971, Journal of Molecular Biology 56:209-221), by 
electroporation (see, e.g., Shigekawa and Dower, 1988, 
Biotechniques 6:742-751), or by conjugation (see, e.g., Koehler 
and Thorne, 1987, Journal of Bacteriology 169:5771-5278). 

The host cell may be a fungal cell. ''Fungi" as used 

35 herein includes the phyla Ascomycota, Basidiomycota, 
Chytridiomycota, and Zygomycota (as defined by Hawksworth et 
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al., in, Ainsworth and Bisby's Dictionary of The Fungi, 8th 
edition, 1995, CAB International, University Press, Cambridge, 
UK) as well as the Oomycota (as cited in Hawksworth et al . , 
1995, supra, page 171) and all mitosporic fungi (Hawksworth et 
5 al., 1995, supra). Representative groups of Ascomycota 
include, e.g., Neurospora, Eupenicillium (=Penicillium) , 
Emericella (=Aspergillus) , Eurotium (=Aspergillus) , and the 
true yeasts listed above. Examples of Basidiomycota include 
mushrooms, rusts, and smuts. Representative groups of 

10 Chytridiomycota include, e.g., Allomyces, Blastocladiella, 
Coelomomyces, and aquatic fungi. Representative groups of 
Oomycota include, e.g., Saprolegniomycetous aquatic fungi 
(water molds) such as Achlya. Examples of mitosporic fungi 
include Aspergillus, Penicillium, Candida, and Alternaria. 

15 Representative groups of Zygomycota include, e.g., Rhizopus and 
Mucor . 

The fungal host cell may be a yeast cell. ^^Yeast" as used 
herein ' includes ascosporogenous yeast (Endomycetales) , 
basidiosporogenous yeast, and yeast belonging to the Fungi 

20 Imperfecti (Blastomycetes) . The ascosporogenous yeasts are 
divided into the families Spermophthoraceae and 
Saccharomycetaceae . The latter is comprised of four 

si±>f amilies, Schizosaccharomycoideae (e.g., genus 

Schizosaccharomyces) , Nadsonioideae, Lipomycoideae, and 

25 Saccharomycoideae (e.g., genera Pichia, Kluyveromyces and 
Saccharomyces) . The basidiosporogenous yeasts include the 
genera Leucosporidim, Rhodosporidium, Sporidiobolus, 
Filobasidium, and Filobasidiella. Yeast belonging to the Fungi 
Imperfecti are divided into two families, Sporobolomycetaceae 

30 (e.g., genera Sorobolomyces and Bullera) and Cryptococcaceae 
(e.g., genus Candida). Since the classification of yeast may 
change in the future, for the purposes of this invention, yeast 
shall be defined as described in Biology and Activities of 
Yeast (Skinner, F.A., Passmore, S.M., and Davenport, R.R., eds, 

35. Soc. App. Bacterid. Symposium Series No. 9, 1980. The biology 
of yeast and manipulation of yeast genetics are well known in 
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the art (see, e.g., Biochemistry and Genetics of Yeast, Bacil, 
M./ Horecker, B.J-, and Stopani, A.O.M., editors, 2nd edition, 
1987; The Yeasts, Rose, A.H., and Harrison, J.S., editors, 2nd 
edition, 1987; and The Molecular Biology of the Yeast 
Saccharomyces, Strathern et al., editors, 1981). The yeast host 
cell may be selected from a cell of a species of Candida, 
Kluyveromyces , Saccharomyces , Schizosaccharomyces , Candida, 
Pichia, Hansehula, , or Yarrowia. In a preferred embodiment, 
the yeast host cell is a Saccharomyces carlsbergensis, 
Saccharomyces cerevisiae , Saccharomyces diastaticus , 
Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces 
norbensis or Saccharomyces oviformis cell. Other useful yeast 
host ceMs are a Kluyveromyces lactis Kluyveromyces fragilis 
Hansehula polymorpha, Pichia pastoris Yarrowia lipolytica, 
Schizosaccharomyces pombe, Ustilgo maylis, Candida maltose, 
Pichia guillermondii and Pichia methanolio cell (cf . Gleeson et 
al., J. Gen. Microbiol. 132, 1986, pp. 3459-3465; US 4,882,279 
and US 4,879,231) . 

The fungal host cell may be a filamentous fungal cell. 
"Filamentous fungi" include all filamentous forms of the 
subdivision Eumycota and Oomycota (as defined by Hawksworth et 
al., 1995, supra). The filamentous fungi are characterized by 
a vegetative mycelium composed of chitin, cellulose, glucan, 
chitosan, mannan, and other complex polysaccharides. 
Vegetative growth is by hyphal elongation and carbon catabolism 
is obligately aerobic. In contrast, vegetative growth by 
yeasts such as Saccharomyces cerevisiae is by budding of a 
unicellular thallus and carbon catabolism may be fermentative. 
In a more preferred embodiment, the filamentous fungal host 
cell is a cell of a species of, but not limited to, Acremonium, 
Aspergillus, Fusarium, Humicola, Mucor, Myceliophthora, 
Neurospora, Penicillium, Thielavia, Tolypocladium, and 
Trichoderma or a teleomorph or synonym thereof. In an even 
more preferred embodiment, the filamentous fungal host cell is 
an Aspergillus cell. In another even more preferred 

embodiment, the filamentous fungal host cell is an Acremonium 
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cell. In another even more preferred embodiment, the 
filamentous fiingal host cell is a Fusarium cell. In another 
even more preferred embodiment, the filamentous fungal host 
cell is a Humicola cell. In another even more preferred 
5 embodiment, the filamentous fungal host cell is a Mucor cell. 
In another even more preferred embodiment, the filamentous 
fungal host cell is a Myceliophthora cell. In another even 
more preferred embodiment, the filamentous fungal host cell is 
a Neurospora cell. In another even more preferred embodiment, 

10 the filamentous fungal host cell is a Penicillium cell. In 
another even more preferred embodiment, the filamentous fungal 
host cell is a Thielavia cell. In another even more preferred 
embodiment, the filamentous fungal host cell is a Tolypocladium 
cell. In another even more preferred embodiment, the 

15 filamentous fungal host cell is a Trichoderma cell. In a most 
preferred embodiment, the filamentous fungal host cell is an 
Aspergillus awamori, Aspergillus foetidus, Aspergillus 
japonicus, Aspergillus niger, Aspergillus nidulans or 
Aspergillus oryzae cell. In another most preferred embodiment, 

20 the filamentous fungal host cell is a Ftisarium cell of the 
section Discolor (also known as the section Fusarium) . For 
example, the filamentous fungal parent cell may be a Fusarium 
bactridioides, Fusarium cerealis, Fusarium crookwellense, 
Fusarium culmorum, Fusarium graminearum, Fusarium graminum, 

25 Fusarium heterosporum, Fusarium negundi, Fusarium reticulatum, 
Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, 
Fusarium sulphureum, or Fusarium trichothecioides cell. .In 
another prefered embodiment, the filamentous fungal parent cell 
is a Fusarium strain of the section Elegans, e.g., Fusarium 

30 oxysporum. In another most preferred embodiment, the 

filamentous fungal host cell is a Humicola insolens or Humicola 
lanuginosa cell. In another most preferred embodiment, the 
filamentous fungal host cell is a Mucor miehei cell. In 
another most preferred embodiment, the filamentous fungal host 

35 cell is a Myceliophthora thermophilum cell. In another most 
preferred embodiment, the filamentous fungal host cell is a 
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Neurospora crassa cell. In another most preferred embodiment, 
the filamentous fungal host cell is a Penicillium purpurogenum 
cell. In another most preferred embodiment, the filamentous 
fungal host cell is a Thielavia terrestris cell or a Acremonium 
5 chrysogenum cell. In another most preferred embodiment, the 
Trichoderma cell is a Trichoderma harzianum, Trichoderma 
koningii, Trichoderma longibrachiatum, Trichoderma reesei or 
Trichoderma viride cell. 

The use of Aspergillus spp. for the expression of proteins 

10 is described in, e.g., EP 272 277, EP 230 023. Fungal cells may 
be transformed by a process involving protoplast formation, 
transformation of the protoplasts, and regeneration of the cell 
wall in a manner known per se. Suitable procedures for 
transformation of Aspergillus host cells are described in EP 

15 238 023 and Yelton et al , , 1984, Proceedings of the National 
Academy of Sciences USA 81:1470-1474. A suitable method of 
transforming Fusarium species is described by Malardier et al . , 
1989, Gene 78:147-156 or in copending US Serial No. 08/269,449. 
Examples of other fungal cells are cells of filamentous fungi, 

20 e.g. Aspergillus spp., Neurospora spp., Fusarium spp. or 
Trichoderma spp., in particular strains of A. oryzae, A. 
nidulans or A. niger. The transformation of F. oxysporum may, 
for instance, be carried out as described by Malardier et al., 
1989, Gene 78: 147-156. 

25 Yeast may be transformed using the procedures described by 

Becker and Guarente, In Abelson, J.N. and Simon, M.I., editors. 
Guide to Yeast Genetics and Molecular Biology, Methods in 
Enzymology, Volume 194, pp 182-187, Academic Press, Inc., New 
York; Ito et al., 1983, Journal of Bacteriology 153:163; and 

30 Hinnen et al., 1978, Proceedings of the National Academy of 
Sciences USA 75:1920. Mammalian cells may be transformed by 
direct uptake using the calcium phosphate precipitation method 
of Graham and Van der Eb (1978, Virology 52:546). 

The transformed or transfected host cells described above 

35 are cultured in a suitable nutrient medium under conditions 
permitting the expression of the desired polypeptide, after 
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which the resulting polypeptide is recovered from the cells, or 
the culture broth. 

The medium used to culture the cells may be any conventional 
medium suitable for growing the host cells , such as minimal or 
5 complex media containing appropriate supplements. Suitable 
media are available from commercial suppliers or may be 
prepared according to published recipes (e.g. in catalogues of 
the American Type Culture Collection). The media are prepared 
using procedures known in the art (see, e.g., references for 

10 bacteria and yeast; Bennett, J.W, and LaSure, L., editors, More 
Gene Manipulations in Fungi, Academic Press, CA, 1991) . 

If the polypeptide is secreted into the nutrient medium, 
the polypeptide can be recovered directly from the medium. If 
the polypeptide is not secreted, it is recovered from cell 

15 lysates. The polypeptide are recovered from the culture medium 
by conventional procedures including separating the host cells 
from the medium by centrifugation or filtration, precipitating 
the proteinaceous components of the supernatant or filtrate by 
means of a salt, e.g. ammonium sulphate, purification by a 

20 variety of chromatographic procedures, e.g. ion exchange 
chromatography , ge 1 f i 1 1 rat ion chr oma t ography , af f ini t y 
chromatography, or the like, dependent on the type of 
polypeptide in question. 

The polypeptides may be detected using methods known in 

25 the art that are specific for the polypeptides. These 
detection methods may include use of specific antibodies, 
formation of an enzyme product, or disappearance of an enzyme 
substrate. For example, an enzyme assay may be used to 
determine the activity of the polypeptide. 

30 The polypeptides of the present invention may be purified 

by a variety of procedures known in the art including, but not 
limited to, chromatography (e.g., ion exchange, affinity, 
hydrophobic, chromatof ocusing, and size exclusion), 
electrophoretic procedures (e.g., preparative isoelectric 

35 focusing (lEF) , differential solubility (e.g., ammonium sulfate 
precipitation), or extraction (see, e.g., Protein Purification, 
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J.-C. Janson and Lars Ryden, editors, VCH Publishers, New York, 
1989) . 



Detailed description of the Invention 

A method for increasing the number of copies of an 
amplification unit integrated into a host cell chromosome, 
according to the first, second, or third aspect of the 
invention. 

In the industry there are a number of preferred bacterial 
host cells, especially Gram-positive microorganisms are 
desirable . 

Accordingly in a preferred embodiment the invention 
relates to the method of the first two aspects, wherein the 
host cell is a Gram-positive bacterial cell, preferably a 
Bacillus cell, more preferably a Bacillus cell of a species 
chosen from the group consisting of Bacillus alkalophilus. 
Bacillus amyloliquef aciens. Bacillus brevis. Bacillus 
circulans. Bacillus clausii. Bacillus coagulans. Bacillus 
lautus. Bacillus lentus. Bacillus licheniformis. Bacillus 
megaterium. Bacillus stearothermophilus. Bacillus subtilis, and 
Bacillus thuringiensis; and most preferably a Bacillus 
licheniformis cell , 

A host cell is susceptible to an inhibitory compoimd, if 
the host cell has reduced growth rate in the presence of the 
compound when compared to the growth rate in the absence of the 
compound in a growth medium, or if the host cell becomes non- 
culturable in the presence or the compound, or if the host cell 
is killed in the presence of the compound. Antibiotics fall 
under this definition of inhibitory compounds however not all 
inhibitory compounds are classified as classical antibiotics. 

The inhibitory compound may be endogenous ly produced by 
the host cell as part of the host cell's normal metabolism, 
where the compound is normally not foimd in inhibitory 
concentrations. Rendering a chromosomal gene of the host cell 
non- functional may result in the accumulation of an 
endogenously produced inhibitory compound within the host cell 
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resulting in an inhibitory concentration of the compound. In 
some cases the inhibitory compound is only produced in the host 
cell when the host cell is cultivated in the presence of a 
precursor. In a preferred embodiment of the invention the 
5 inhibitory compound is UDP-galactose . 

Preferable examples of precursors are galactose containing 
compounds - such as lactoses, melibioses, raffinoses, 
stachyoses, verbascoses and galactinola. More preferable 
precursors of galactose include alfa-lactose (beta-D- 

10 galactopyranosyl- [l->4] -alfa-D-glucose) , and other substrates 
which liberates free D-galactose upon hydrolysis by either 
alfa-galactosidases or beta-galactosidases . Other examples of 
potentially useful precursors for use in the method of the 
invention are chemically derivatised forms of galactose, 

15 preferably chemical derivatives of D-galactose, from which D- 
galactose can be liberated by use of appropriate techniques, 
such as enzyme action, where the appropriate enzyme may be 
comprised in the medium or may be added to the medium or may 
indeed be secreted into the medium by the host cell. By way of 

20 example suitable derivatives are D-galactose pentaacetate and 
D-galactose methyl galactoside. Preferably the medium may 
comprise a derivative of galactose, such as galactose-1- 
phosphate or UDP-galactose . 

Accordingly in a preferred embodiment the invention 

25 relates to the method of the first, second or third aspects, 
wherein the chromosomal gene of step a) encodes an enzyme, 
preferably chosen from the group consisting of galactokinase 
(EC 2.7.1.6), UTP-dependent pyrophosphorylase (EC 2.7.7.10), 
UDP-glucose-dependent uridylyltransf erase (EC 2.7.7.12), UDP- 

30 galactose epimerase (EC 5.1-.2.3); more preferably the 
chromosomal gene of step a) encodes an enzyme with UDP- 
galactose epimerase activity (EC 5.1.2.3), and most preferably 
the chromosomal gene of step a) is galE. 

Further in a preferred embodiment the invention relates to 

35 the method of the first, second, or third aspects, wherein the 
inhibitory compound is UDP-galactose. 
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Still further in a preferred embodiment the invention 
relates to the method of the first, second, or third aspects, 
wherein the precursor is free galactose, preferably free D- 
galactose; more preferably the precursor can be degraded to 
produce free galactose, or preferably free D-galactose; even 
more preferably the precursor is lactose, melibiose, raffinose, 
stachyose, verbascose or galactinol. 

Another preferred embodiment of the invention relates to 
the method of the first, second, or third aspects, wherein the 
medium comprises an enzyme capable of degrading the precursor 
to produce free galactose, or preferably free D-galactose. 

One preferred embodiment of the invention relates to the 
method of the first, second, or third aspects, wherein the host 
cell secretes an enzyme into the medium which is capable of 
degrading the precursor to produce free galactose, or 
preferably free D-galactose, preferably the enzyme is a 
galactosidase, preferably an alfa-galactosidase or a beta- 
galactosidase . 

As mentioned above this invention also concerns a nucleic 
acid construct as defined elsewhere herein along with one or 
more components also described elsewhere herein that may be 
comprised in the construct. 

Consequently a preferred embodiment of the invention 
relates to the method of the first, second, or third aspects, 
wherein wherein the nucleic acid construct is a plasmid. 

In a non-limiting example shown herein of the method of 
the invention it is demonstrated how antibiotic selectable 
markers may be comprised in the nucleic acid construct of the 
invention, and also how such markers may eventually be removed 
from the host cell by the help of specific resolvase enzymes, a 
technique which is well known in the art. 

Accordingly a preferred embodiment of the invention 
relates to the method of the first, second, or third aspects, 
wherein the nucleic acid construct further comprises an 
antibiotic selection marker, preferably flanked by by resolvase 
sites or res-sites. 
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As described supra chromosomal integration of a vector or 
a smaller part of a vector - such as an amplification unit as 
defined supra - into the genome of the host cell can be 
achieved by a number of ways. A non-limiting example of 
5 integration by homologous recombination was shown herein. 

A preferred embodiment of the invention relates to the 
method of the first, second, or third aspects, wherein the 
amplification unit further comprises a nucleotide sequence with 
a homology to a chromosomal nucleotide sequence of the host 

10 cell sufficient to effect chromosomal integration in the host 
cell of the amplification unit by homologous recombination, 
preferably the amplification unit further cortqprises a 
nucleotide sequence of at least 100 bp, preferably 200 bp, more 
preferably 300 bp, even more preferably 400 bp, and most 

15 preferably at least 500 bp with an identity of at least 70%, 
preferably 80%, more preferably 90%, even more preferably 95%, 
and most preferably at least 98% identity to a chromosomal 
nucleotide sequence of the host cell , 

In a non- limiting example integration into the chromosome 

20 of a host cell can be selected for by first rendering a 
conditionally essential host cell gene non- functional as 
described elsewhere herein, thereby rendering the host cell 
selectable, then targetting the vector's integration by 
including on this a likewise non- functional copy of same host 

25 gene of a size that allows homologous recombination between the 
two different copies of the non- functional host genes in the 
genome of the host cell and on the integration vector - where 
such a recombination will restore a functional copy of the 
gene, thus leaving the host cell selectable. 

30 Accordingly a preferred embodiment of the invention 

relates to the method of the first, second, or third aspects, 
wherein the nucleotide sequence comprised in the amplification 
unit is a partial non- functional copy of a conditionally 
essential gene of the host cell, wherein the host cell prior to 

35 the first step of the invention has had the conditionally 
essential gene rendered non functional by a partial deletetion, 
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and wherein a recombination event between the partial copy of 
the gene comprised in the amplification imit and the partial 
chromosomal gene restores a functional chromosomal gene; 
preferably the conditionally essential gene encodes a D-alanine 
racemase, preferably the conditionally essential gene is dal. 

Another preferred embodiment of the invention relates to 
the method of the first, second, or third aspects, wherein a 
first amplification unit integrates into the host cell 
chromosome by homologous recombination with the partially 
deleted conditionally essential gene and renders the gene 
functional . 

Yet another preferred embodiment of the invention relates 
to the method of the first, second, or third aspects, wherein 
the amplification unit further comprises an antibiotic marker, 
preferably flanked by resolvase sites or res-sites; preferably 
a host cell comprising a first chromosomally integrated 
amplification unit is selected and the antibiotic marker 
excised from the host cell chromosome by a resolvase prior to 
the next step in the method. 

In the industrial production of polypeptides it is of 
interest to cultivate a host cell comprising several copies of 
a gene encoding a polypeptide of interest to achieve high 
yields. A preferred embodiment of the invention relates to 
the method of the first, second, or third aspects, wherein the 
gene of interest encodes an polypeptide of interest, preferably 
the polypeptide is an enzyme such as a protease; a cellulase; a 
lipase; a xylanase; a phospholipase; or preferably an amylase. 

Another preferred embodiment of the invention relates to 
the method of the first, second, or third aspects, wherein the 
polypeptide is a hormone, a pro-hormone, a pre -pro -hormone, a 
small peptide, a receptor, or a neuropeptide. 

In the present invention the expressably copy of a 
chromosomal gene as defined above is transcribed at a reduced 
level compared to the wild type level of the gene in the host 
cell . 
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One preferred embodiment of the invention relates to the 
method of the first, second, or third aspects, wherein the 
expressable copy of the chromosomal gene comprised in an 
amplification unit integrated in the host cell chromosome has a 
5 reduced transcription level- compared to the transcription level 
of the wild type gene of the host cell, preferably the 
transcription level is reduced with a factor of 100, preferably 
50, more preferably 10, even more preferably 5, and most 
preferably with a factor of 2; preferably the expressable copy 
10 of the chromosomal gene comprised in the amplification unit is 
promoterless, more preferably 

the expressable copy of the chromosomal gene comprised in the 
amplification unit has a transcription terminator located 
upstream of the gene. 

15 In a non- limiting example herein the gene of interest is 

located upstream from the expressable copy of the chromosomal 
gene and the two genes are co- transcribed from the promoter of 
the gene of interest . 

A preferred embodiment of the invention relates to the 

20 method of the first, second, or third aspects, wherein the gene 
of interest is located upstream of the expressable copy of the 
chromosomal gene within the amplification unit and wherein the 
two genes are co-directionally transcribed; preferably the 
expressable copy of the chromosomal gene is expressed by read- 

25 through transcription from the gene of interest. 

The method of the present invention provides a number of 
genetic tools that are advantageous in the invention. An 
amplification unit of the fourth aspect of the invention. 

In a preferred embodiment the invention relates to the 

30 amplification unit of the fourth aspect of the invention 
wherein the chromosomal gene encodes an enzyme, preferably 
chosen from the group consisting of galactokinase (EC 2.7.1.6), 
UTP-dependent pyrophosphorylase (EC 2.7.7.10), UDP-glucose- 
dependent uridylyltransf erase (EC 2.7.7.12), UDP-galactose 

35 epimerase (EC 5.1.2,3); preferably the chromosomal gene encodes 
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an enzyme with UDP-galactose epimerase activity (EC 5.1,2.3); 
more preferably the chromosomal gene is galE. 

In the industrial production of polypeptides it is of 
interest to cultivate a host cell comprising several copies of 
5 a gene encoding a polypeptide of interest to achieve high 
yields . 

Accordingly a preferred embodiment of the invention 
relates to the amplification unit of the fourth aspect of the 
invention wherein the gene of interest encodes an polypeptide 

10 of interest; preferably the polypeptide is an enzyme such as a 
protease; a cellulase; a lipase; a xylanase; a phospholipase; 
or preferably an amylase. 

Another preferred embodiment of . the invention relates to 
the amplification unit of the fourth aspect of the invention 

15 wherein the polypeptide is a hormone, a pro-hormone, a pre-pro- 
hormone, a small peptide, a receptor, or a neuropeptide. 

Yet another preferred embodiment of the invention relates 
to the^ amplification unit of the fourth aspect of the invention 
wherein the expressable copy of the chromosomal gene is 

20 promoterless; - preferably the expressable copy of the 
chromosomal gene has a transcription terminator located 
upstream of the gene; and preferably the gene of interest is 
located upstream of the expressable copy of the chromosomal 
gene and wherein the two genes are co-directionally 

25 transcribed, more preferably the expressable copy of the 
chromosomal gene is expressed by read-through transcription 
from the gene of interest . 

A preferred embodiment of the invention relates to the 
amplification unit of the fourth aspect of the invention which 

30 further comprises an antibiotic marker, preferably flanked by 
resolvase sites or res-sites. 

As mentioned above the method of invention also provides a 
number of genetic tools, a nucleic acid construct comprising a 
unit as defined in any of the previous embodiments of the 

35 fourth aspect , 
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The method of the invention provides a host cell of 
interest for the industry; a host cell wherein a chromosomal 
gene has been rendered non- functional leaving the host cell 
susceptible to an inhibitory compound endogenously produced by 
5 the host cell when cultivated in a medium comprising a 
precursor; and wherein the host cell comprises an amplification 
unit as defined in any of the embodiments of the fourth aspect 
or a nucleotide construct as defined in the fifth aspect. 

Accordingly a preferred embodiment of the invention 

10 relates to the host cell of the sixth aspect, wherein the host 
cell is a Gram-positive bacterial cell, preferably a Bacillus 
cell, more preferably a Bacillus cell of a species chosen from 
the group consisting of Bacillus alkalophilus, Bacillus 
amyloliquef aciens, Bacillus brevis. Bacillus circulans, 

15 Bacillus clausii. Bacillus coagulans, Bacillus lautus, Bacillus 
lentus. Bacillus licheniformis, Bacillus megaterium. Bacillus 
stearothermophilus, Bacillus sxibtilis, and Bacillus 
thuringiensis; and most preferably a Bacillus licheniformis 
cell . 

20 In another preferred embodiment the invention relates to 

the host cell of the sixth aspect, wherein the chromosomal gene 
encodes an enzyme, preferably the enzyme is chosen from the 
group of enzymes consisting of galactokinase (EC 2.7.1.6), UTP- 
dependent pyrophosphorylase (EC 2.7.7.10), UDP-glucose- 

25 dependent uridylyltransf erase (EC 2.7.7.12), UDP-galactose 
epimerase (EC 5.1.2,3), more preferably the enzyme is an UDP- 
galactose epimerase (EC 5.1.2.3), and most preferably the 
enzyme is encoded by galE. 

In yet another preferred embodiment the invention relates 

30 to the host cell of the sixth aspect, where the inhibitory 
compound is UDP-galactose and preferably where the precursor is 
free galactose, preferably free D-galactose; even more 
preferably the precursor can be degraded to produce free 
galactose, or preferably free D-galactose; even more preferably 

35 the precursor is lactose, melibiose, raffinose, stachyose, 
verbascose or galactinol; yet even more preferably the medium 
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comprises an enzyme capable of degrading the precursor to 
produce free galactose, or preferably free D-galactose. 

A preferred embodiment of the invention relates to the 
host cell of the sixth aspect, where the host cell secretes an 
5 enzyme into the medium which is capable of degrading the 
precursor to produce free galactose, or preferably free D- 
galactose; more preferably the enzyme is a galactosidase, 
preferably an alf a-galactosidase or a beta-galactosidase. 

Another preferred embodiment of the invention relates to 

10 the host cell of the sixth aspect, wherein the amplification 
unit further comprises a nucleotide sequence of at least 100 
bp, preferably 200 bp, more preferably 3 00 bp, even more 
preferably 400 bp, and most preferably at least 500 bp with an 
identity of at least 70%, preferably 80%, more preferably 90%, 

15 even more preferably 95%, and most preferably at least 98% 
identity to a chromosomal nucleotide sequence of the host cell, 
A preferred embodiment of the invention relates to the 
host cell of the sixth aspect, wherein the nucleotide sequence 
comprised in the amplification unit is a partial non-functional 

20 copy of a conditionally essential gene of the host cell, 
wherein the host cell has had the conditionally essential gene 
rendered non functional by a partial deletetion, and wherein a 
recombination event between the partial copy of the gene 
comprised in the amplification unit and the partial chromosomal 

25 gene has restored a functional chromosomal gene; preferably the 
conditionally essential gene encodes a D-alanine racemase, 
preferably the conditionally essential gene is dal. 

Another preferred embodiment of the invention relates to 
the host cell of the sixth aspect, wherein the expressable copy 

30 of the chromosomal gene of the amplification unit has a reduced 
transcription level compared to the transcription level of the 
wild type gene of the host cell, preferably the transcription 
level is reduced with a factor of 100, preferably 50, more 
preferably 10, even more preferably 5, and most preferably with 

35 a factor of 2 . 
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Finally the invention provides a process for producing a 
polypeptide of interest, wherein the process comprises a step 
of cultivating a host cell as defined in any of the embodiments 
of the sixth aspect, 
5 Accordingly a preferred embodiment of the invention 

relates to the process of the final aspect, wherein the 
polypeptide is an enzyme such as a protease; a cellulase; a 
lipase; a xylanase; a phospholipase; or preferably an amylase.. 

Another preferred embodiment of the invention relates to 
10 the process of the final aspect, wherein the polypeptide is a 
hormone, a pro-hormone, a pre -pro -hormone, a small peptide, a 
receptor, or a neuropeptide. 



15 Introduction to Examples 

In order to use the galE gene as a marker in B. siibtilis, 
it . is necessary to delete the native galE gene on the 
chromosome. This mutant will be tested on different medias with 
and without galactose and glucose to confirm the phenotype. 

20 To enable an evaluation of the galE gene as an 

amplification marker, we decided to subclone the gene on an 
amplification vector comprising an AA560 amylase encoding gene 
as a reporter enzyme to determine the actual expression level 
of clones with single and multiple copies. Selection for 

25 multiple copies of the galE gene requires that the gene is 
expressed at a very low level. A weakly expressed galE gene 
will assure that only clones with many copies and sufficient 
expression of the epimerase will allow growth in the presence 
of galactose. The subduing of galE expression is done by 

30 siibcloning galE without expression signals downstream of the 
transcriptional terminator of the AA560 amylase gene. 
Transcription of galE is then dependant of the AA560 promoter 
and the very limited transcriptional read- through of the 
terminator. 

35 The amplification vector also comprises the C-terminal 

part of the dal gene which can complement a dal-minus B. 
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subtilis with a C-terminal deletion of the dal gene. 
Transformation of the dal -minus B. svibtilis with this 
amplification plasmid will enable direct selection for 
integration at the dal locus, when plated on media without D- 
alanine. 



Materials and Methods 

Strains and Donor Organisms 

Bacillus subtilis PL1801: This strain is a B. subtilis 
DN1885 which has disrupted apr and npr genes (Diderichsen, B et 
al. 1990. Cloning of aldB, which encodes alpha-acetolactate 
decarboxylase, an exoenzyme from Bacillus brevis. j. 
Bacterid., 172, 4315-4321). 

B. s\abtilis DN1886: This strain is a B. subtilis DN1885 
with a disrupted dal gene. 

B. subtilis PL1955: This strain is a B. subtilis PL1801 
carrying the plasmid pE194 which can deliver the RepF protein 
to support replication of replication-minus pE194 derivatives 
lacking the repF gene. 

B. subtilis MOL1794: This strain is a B. subtilis PL1801 
where the galE gene was replaced with a kanamycine resistance 
gene by use of the plasmid pMOL1748 (SEQ ID 1) . 

B. subtilis MOL1805: This strain is a DN1686 (dal-) strain 
where the galE gene was replaced with a kanamycine resistance 
gene. 

B. subtilis MOL1875: This strain is a MOL1805 where the 
kanamycine resistance gene gene was excised (dal-, galE-, no 
antibiotic markers) . 

Plasmids 

PMOL1748 (SEQ ID 1) : This plasmid is a pE194 derivative 
(Horinouchi, S and Weisblum, B., 1982, J. Bacterid. 150:804- 
814) essentially containing elements making the plasmid 



wo 01/90393 PCT/DKOl/00356 

36 

propagatable in Bacillus siibtilis, a kanamycin resistance gene, 
a gene conferring resistance to erythromycine, two flanking 
fragments from B. subtilis galE inserted upstream and 
downstream of the kanamycine resistance gene, two direct 
5 repeats that signify the res site from pAMpi and a fragment 
from pUBllO coding for the origin of transfer (McKenzie/ T. et 
al., 1986, Plasmid 15:93-103). This plasmid is used for 
deleting the galE gene in the B, subtilis strains PL1801 and 
DN1686. 

10 Table 1: pM0L1748 (6405 bp) 



Position 
(bp) 


Size (bp) 


Element (bp) 


Origin 


429-432 


4 


Linker 


Synthetic 


433-605 


173 


res site from 


E. 






pAMpl 


f aecalis 


606-978 


373 


Downstream 


B. 






galE seq 


subtilis 


979-1038 


60 


Linker 


Synthetic 


1039-4768 


3730 


pE194 


S . aureus 


4769-4779 


11 


Linker 


Synthetic 






sequence 




4780-5317 


538 


pUBllO 


S ■ aureus 


5318-5342 


25 


Linker 


Synthetic 


5343-5666 


324 


Upstream galE 


B. 






seq. 


subtilis 


5667-5685 


19 


Linker 


Synthetic 


5686-5858 


173 


res site from 


E. 






pAMPi 


f aecalis 


5859-5864 


6 


Linker 


Synthetic 


5865-428 


969 


pUBllO (Kan 


S . aureus 






gene) 





pMOL1807 (SEQ ID 2) and pMOL1809 (SEQ ID 3) : These 
plasmids are replication-minus pE194 derivatives (Horinouchi, S 
and Weisblum, B., 1982, J. Bacterid. 150:804-814) containing 

15 the origin of replication but lacking the repF gene coding for 
the replication protein. The repF deleted plasmid is totally 
dependant on replication protein delivered in trans from either 
a second plasmid or a chromosomally encoded repF gene in order 
to replicate. The plasmids codes for the kanamycine resistance 

20 gene, an alfa-amylase designated AA560, a promoterless galE 
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gene of B. subtilis, the C-terminal part of a dal gene for 
complementation of the dal-minus phenotype in DN1686 and 
derivatives thereof (such as M0L1875) . The alfa-araylase gene 
and the galE gene are transcriptionally fused in both plasmids 

5 but the PMOL1807 (SEQ ID 2) plasraid also has a transcriptional 
terminator located between the two genes, which only allows 
minor transcriptional read- through. These plasmids are used for 
integration and amplification studies in the dal locus of 
MOL1875. 

10 Table 2: pMOL1807 (5943 bp) 



Position 
(bp) 


Size (bp) 


Element (bp) 


Origin 


5-828 


824 


C-terminal dal 


B. 




sequence 


subtilis 


829-833 


5 


Linker 
sequence 


Synthetic 


834-2045 


1212 


pUBllO (Kana) 


S . aureus 


2046-2066 


21 


Linker 
sequence 


Synthetic 


2067-2316 


250 


pE194 (ori) 


S . aureus 


2317-2328 


12 


Linker 
sequence 


Synthetic 


2329-2884 


556 


pUBllO (oriT) 


S . aureus 


2885-2904 


20 


Linker 
sequence 


Synthetic 


2905-3167 


263 


amyL promoter 
and signal 
peptide 


B. 

lichenifo 
rmis 


3168-3176 


9 


Linker 
sequence 


Synthetic 


3177-4631 


1455 


□-amylase 
AA560 (NN5820) 


B. 

species 


4632-4660 


29 


Linker 
sequence 


Synthetic 


4661-4776 


116 


AmyL 

terminator 


B. 

lichenifo 
rmis 


4777-4803 


27 


Linker 
sequence 


Synthetic 


4804-5942 


1139 


galE 


B. 

subtilis 


5943-4 


5 


Linker 
sequence 


Synthetic 



Table 3: pMOL1809 (5793 bp) 
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Position 
(bp) 


Size (bp) 


Element (bp) 


Origin 


5-828 


824 


C- terminal dal 


B. 






sequence 


subtilis 


829-833 


5 


Linker 


Synthetic 






sequence 




834-2045 


1212 


pUBllO (Kana) 


S ■ aureus 


2046-2066 


21 


Linker 


Synthetic 






sequence 




2067-2316 


250 


pE194 (ori) 


S , aureus 


2317-2328 


12 


Linker 


Synthetic 






sequence 




2329-2884 


556 


pUBllO (oriT) 


S . aureus 


2885-2904 


20 


Linker 


Synthetic 






sequence 




2905-3167 


263 


amyL promoter 


B. 






and signal 


lichenifo 






peptide 


rmis 


3168-3176 


9 


Linker 


Synthetic 






sequence 




3177-4631 


14bb 


D- amylase 


a . 






AA560 (NN5820) 


species 


4632-4653 


22 


Linker 


Synthetic 






sequence 




4654-5792 


1139 


AmyL 


B. 






terminator 


lichenifo 








rmis 


5793-4 


5 


Linker 


Synthetic 






sequence 





pWT: a temperature sensitive, high copy number pAMpi der- 
5 ivative plasmid cottprising a gene coding for the resolvase 
enzyme from pAMbetal which can act on resolvase recognition 
sites (res) and an Erm resistance marker. 



Media 

10 TY (as described in Ausubel, F. M. et al. (eds.) "Current 

protocols in Molecular Biology". John Wiley and Sons, 1995) . 

LB agar (as described in Ausubel, F. M. et al . (eds.) 
"Current protocols in Molecular Biology". John Wiley and Sons, 
1995). LBP is LB agar supplemented with 0.05 M potassium 

15 phosphate, pH 7.0. LBPG is LB agar supplemented with 0.5% 
Glucose and 0.05 M potassium phosphate, pH 7.0. LBPSK is LB 
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agar supplemented with 0.05 M potassium phosphate, pH 7.0 and 
1% of skimmed milk. 

BPX media is described in EP 0 506 780 (WO 91/09129) . 

TSS agar (as described in Fouet A. and Sonenshein, A. L. 
5 (1990) A Target for Carbon Source -Dependant Negative Regulation 
of the citB Promoter of Bacillus subtilis. J, Bacterid., 172, 
835-844) . 

TSSara medium is TSS medium supplemented with 0.2% 
arabinose 

10 When appropriate, glucose was replaced with 0.5% galactose 

unless otherwise stated. For plates, 2% agar was added for 
solid media. For amylase phenotypic detection the plates were 
supplemented with 0.2% starch. When appropriate 10 mg/ml 
kanamycine was added. 

15 

Propagation of PL1801 strain. 

The Bacillus subtilis strain PL1801 was propagated in 
liquid medium 3 as specified by ATCC (American Type Culture 
Collection, USA) . After 18 hours incubation at 37°C and 300 
20 rpm, the cells were harvested, and genomic DNA was isolated by 
the method described below. 

Genomic DNA Preparation 

The Bacillus subtilis strain PL1801 was propagated in 
25 liquid media as described above. The cells were harvested, and 
genomic DNA was isolated by the method described by Pitcher et 
al . 1989, Rapid extraction of bacterial genomic DNA with guani- 
dium thiocyanate; Lett Appl Microbiol 8:151-156. 

30 General molecular biology methods 

Unless otherwise mentioned the DNA manipulations and 
transformations were performed using standard methods of 
molecular biology (Sambrook et al. 1989. Molecular cloning: A 
laboratory manual, Cold Spring Harbor lab.. Cold Spring Harbor, 

35 NY; Ausubel, F. M. et al . (eds.) "Current protocols in 
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Molecular Biology". John Wiley and Sons, 1995; Harwood, C. R. , 

and Cutting, S. M. (eds.) "Molecular Biological Methods for 

Bacillus". John Wiley and Sons, 1990) . 

Competent cells were prepared and transformed as 
5 described by Yasbin, R.E. et al. 1975, Transformation and 

transfection in lysogenic strains of Bacillus subtilis: 

evidence for selective induction of prophage in competent 

cells. J. Bacterid, 121:296-304. 

Enzymes for DNA manipulations were used according to the 
10 specifications of the suppliers (e.g. restriction 

endonucleases, ligases etc. are obtainable from New England 

Biolabs, Inc. ) . 

PCR reactions were performed using High Fidelity DNA 

Polymerase (Boeringer Mannheim) according to manufacturers 
15 instructions. The PCR reaction was set up in PCR buffer 

containing 200 (iM of each dNTP, 2.5 units of High Fidelity DNA 

Polymerase and 100 pmol of each primer. 

The PCR reactions were performed using a DNA thermal 

cycler PTC-200 (MJ Research) . One incubation at 94oC for 1 min 
20 followed by thirty cycles of PCR performed using a cycle 

profile of denaturation at 94oC for 10 sec, annealing at 60oC 

for 30 sec, and extension at 72oC for 2 min. Five-/il aliquots 

of the amplification product were analysed by electrophoresis 

in 0.7 % agarose gels (NuSieve, FMC) to verify a DNA fragment 
25 of the correct size. 

Fermentations 

Fermentations to evaluate amylase yields were performed in 
shakeflasks with 100 ml BPX at 300C, 300 rpm for five days. 
30 Culture volumes of 10 ml were harvested and centrifuged at 
10.000 g to remove cells and debris. The clear supemants were 
used for assaying alfa-amylase activity or were loaded on SDS 
gels. 



35 Assay for a-amylase activity 



10 
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Alfa-amylase activity was determined by a method emploing 
an enzymatic colorimetric test with 4, 6-ethylidene (G7) -p- 
nitrophenyl (Gl) -a,D-maltoheptaoside (ethylidene-G7PNP) as 
substrate (Boehringer Mannheim, Germany art. 1442309). Under a 
specified set of conditions (temp., pH, reaction time, buffer 
conditions) 1 mg of a given alfa-amylase will hydrolyse a 
certain amount of substrate and a yellow colour will be 
produced. The colour intensity is measured at 405 nm. The 
measured absorbance is directly proportional to the activity of 
the alfa-amylase in question under the given set of conditions. 



SDS-page 

SDS-page was performed on a Novex (Novex, San Diego) 
gradient Tricine 10-20% gel under denaturing conditions as 
15 prescribed by manufacturer. 



EXAMPLES 



20 Deletion of galE in B. subtilis 

A temperature sensitive plasmid was constructed for the 
purpose of deleting the galB gene in B. subtilis. Two flanking 
sequences upstream and downstream of the galE gene were 
amplified by PGR and inserted on each side of a kanamycine 

25 (Kan) marker in the plasmid which further comprised an 
erythromycins (Erm) resistance marker. The primer sequences 
used in the PGR amplifications are as follows: 

Upstream galE fragment: 
30 B5860H10 (SEQ ID 4) : TTACATCCGCGGGTGAGGAAAGACAGGAC 
B5860H11 (SEQ ID 5) : TAGTGAATTCAGAACCGGTCCACATCC 

Downstream galE fragment: 

181804 (SEQ ID 6) : TGTTCCCGAGAATGGAGGCCTTCTCAATTG 
35 181805 (SEQ ID 7) : TGGTTGTCGACATCTGAGGGAGGTACAATTGTAGCTG 
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The resulting plastnid pMOL1748 (SEQ ID 1) was transferred 
to B. subtilis PL1801 and plated on LBPG media with 5/xg/ml 
erytromycine (Erm) . The colonies were re -streaked twice on 
plates at 500C to select for integration of the plasmid at the 
5 galE locus. The clones were grown in plain TY at 330C over 4 
days to allow for excition and loss of the plasmid leaving the 
Kan marker in place of the galE gene. The strain MOL1794 was 
screened as being Kan resistant and Erm sensitive, 

A galE deletion strain designated MOL1794 was .tested on 

10 selective TSS minimal media supplemented with 0.2% galactose 
and 0.2% gluconate. The original B, subtilis PL1801 (galE+) 
strain showed fine growth on these plates while the galE- 
strain MOL 1794 showed no growth even after several days of 
incubation- On control TSS plates supplemented with 0.2% 

15 gluconate, both strains grew. The reported toxic effect of 
galactose on a galE- strain is therefore confirmed. 

The galE deletion was transferred to an isogenic D-alanine 
racemase negative (dal-) strain designated DN 1886 by simple 
chromosomal transformation and selection for transfer of the 

20 Kan resistance. A dal- galE- strain was isolated and designated 
MOL1805. 

The Kan resistance marker located in the galE locus of 
M0L1794 and MOL1805 was flanked by resolvase recognition sites 
(res) which allow a specific excision reaction in the presence 

25 of a resolvase. In order to remove the Kan marker from the 
chromosome, M0L1794 and MOL1805 were both transformed with pWT 
which is a temperature sensitive plasmid comprising a gene 
coding for resolvase and an Erm resistance marker. 
Transformants were selected on plates with 5/ig/ml Erm, they 

30 were tested for loss of the Kan marker and further re-streaked 
twice on plates with no antibiotics at 500C to cure the strains 
of the pWT plasmid. Selected clones were screened for loss of 
Erm resistance and Kan resistance and were designated MOL1875 
(DN1886, dal-, galE-; no antibiotic markers) and MOL1877 

35 (PL1801, galE-; no antibiotic markers). 
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Amplification plasmids 

Two different amplification plasmids with (pM0L1807; SEQ 
ID 2) and without (pMOLlSOS; SEQ ID 3) a transcriptional 
terminator between the AA560 amylase encoding gene and galE 
were constructed. The PCR-primers used for fragment 
amplification in the construction of the plasmids were as 
follows : 

C' terminal dal fragment: 

188502 (SEQ ID 8) : TTTTCATCGATACTAGTGTGCACGGATCCATCTGAAGGTCG 

ATACGGG 

188836 (SEQ ID 9) : TTGTTTGTCGACGCAAAGCTGTTTTATGAATTCTCC 
galE fragment primers: 

190694 (SEQ ID 10) : TTTTGGCCCAGCCGGCCAACAGGTCATTTTTTAGGAGGG 

190695 (SEQ ID 11) : TTATTGGATCCGTGAAAATCAAATAACAGCTAACAAGGG 
190697 (SEQ ID 12) : TTTTCATCGATAACAGGTCATTTTTTAGGAGGG 



Amplification experiments 

The two amplification plasmids pMOL1807 (SEQ ID 2) and 
pMOL1809 (SEQ ID 3) were introduced by transformation into 
MOL1875 (dal-, galE-) and plated on solid LBPA media (LB + 
phosphate + 0.2 % starch) without D-alanine to select for 
complementation of the dal phenotype. Transf ormants growing on 
these plates had integrated the plasmids into the dal locus and 
converted the dal- phenotype to dal+. All transf ormants showed 
clearing zones on the starch medium plates which indicated 
integration and expression of the AA560 amylase also. The site 
of integration was verified by PGR and the clones were re- 
streaked on TSSara minimal media both with and without 
galactose to study the galE expression. Clones with integration 
of pMOL1807 (SEQ ID 2) holding the terminator between the AA560 
amylase and the galE gene showed no growth on galactose plates. 
This phenotype demonstrated that a single copy of the 
artificial AA560-galE fusion in this construct did not express 
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sufficient GalE epimerase to remove the toxic UDP-galactose 
that was accumulated in the cells in the- presence of galactose. 
The other construction, pMOL1809 (SEQ ID 3) without a 
transcriptional terminator between the two genes showed some 
5 growth on TSS plates with galactose. From these results it was 
clear that pMOL1807 (SEQ ID 2) had the potential to be used as 
an amplification unit in the presence of galactose. 

The amplification procedure using galactose as the active 
agent can be performed in many different ways using both plates 

10 and broth cultures with different levels of galactose and other 
suger compounds or precursors from which free galactose can be 
released. We performed a number of different amplification 
procedures to evaluate their efficiency. The following table is 
a thorough description of the different amplification steps 

15 each transformant goes through before inoculation in a 
shakeflask (100 ml BPX) . The Kan marker makes it possible to 
amplify by using Kan in the traditional way and then to compare 
the amplification efficiency to the galactose method of the 
invention. 



# 


Amplification method 


KNU(T)/g 


1 


Transformant directly from LBPA 


2.54 


2 


Transformant directly from LBPA 


2.16 


3 


Transformant on LBPA, re-streaked 3 

X 


2.01 


4 


MOL1815 (single copy transformant) 


3.63 


5 


Transformant on LBPA 

>re-streaked on TSS + 0.2% ara + 

0.5% gal 


5.09 


6 


as # 5 + 2% gal in shakeflask 


4.53 


7 


Transformant on LBPA 

>2x(innoc. in liquid TSS + 0.2% ara 

+0.5% gal) 

>2x(re-streaked on TSS + 0.2% ara + 
0.5% gal) 


4.77 


8 


as # 8 + 0.5% gal in shakeflask 


5.66 


9 


Transformant on LBPA 

>re-streaked on TSS + 0.2% ara + 

0.5% gal 

>2x(iniioc. in liquid TY+ 0.5% gal) 
>2x (re-streaked on TSS + 0.2% ara + 
0.5% gal) 


7. 10 


10 


as # 9 + 0.5% gal in shakeflask 


2.09 
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11 


Trans formant on LBPA 

>re-streaked on TSS + 0.2% ara + 

0.5% gal 

>2x(innoc. in liquid TY+ 0.5% gal) 
>2x (re- streaked on LBPA) 


6.70 


12 


as # 11 + 0.5% gal in snaJcerxasK 


4 .35 


13 


Transformant on LBPA 
>re-streaked on TSS + u.-^^ ara -r 
0.5% gal , ^ 
>2x(innoc. m iiquia ii^ ouny/mj. 

Kan) 

>27e- streaKeo on jjjDiriiT ju/Ay/mx xvcu.* 
>re— 8 ureaKecjL oxi jj-pxrrv - 


7 .71 


14 


ji no J. on na/ml Kan in 

shakeflask 


11.60 


15 


as # 9 


6.65 


16 


as # 10 


5.16 


17 


as # 11 


12 .10 


18 


as # 12 


9.40 


19 


as # 13 


7 . 10 


20 


as w 14 


6.30 


21 


Transformant on TSSA + 0.2% ara + 
0.5% gal 

>2x (re- streaked on TSSA + 0.2% ara 
+ 0.5% gal) 


4.30 


22 


as # 21 + 0 - ga-L in snais-tii- J-ca&js. 


5 .60 


23 


as # 21 


2 .90 


24 


as # 22- 


5.00 


25 


Transformant on TSSA + 0.2% ara + 
2% gal 

>2x (re-streaked on TSSA + 0.2% ara 
+ 2% gal) 


3.60 


26 


as # 25 + 0.5% qal in shake£lasK 


5.80 


27 


it 25 


5.00 


28 


- as # 26 1^-^^ 



Table 4: The table shows the amplification method of 
individual clones and the actual amylase yields from a 5 day 
fermentation in 100 ml SKl-M medium at 300C. Some of the 
fermentations were performed in the presence of galactose or 
Kan to select for multiple copies during the fermentations. 
From the table it is obvious that amplification protocols 
using Kan or galactose in TY full broth show the highest 
yields (in bold). These results show that yield improvements 
by adding galactose is as efficient as using Kan. 
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The Southern blot in figure 1 shows samples from different 
strains either amplified by use of galactose or kanamycine or 
strains where no selection pressure is opposed. 

The results summarized herein show that it is indeed 
possible to increase the copy number of a chromosomally 
integrated expression cassette holding the galE gene by adding 
a simple suger compound such as galactose to the growth medium. 
The amplification potential, as judged from the band intensity 
on the Southern blots (figure 1) and the fermentation yields 
(table 4) , is very similar to what can be achieved by the 
traditional kanamycine antibiotic selection/amplification. 
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Claims number of copies of an 

1 A method for increasing the chromosome, 
anplification unit integrated into a host 
..erein the method comprises the stepB of: ^ ^^^^ ^^^^ 
, rendering a chro™ ^e- ^^^^^^^^^^^ 

functional, wherexn the host cell 
...ihitor. compound — ^ T^^^^^^ eo^^rising a 

when the host cell xs cultivated xn 

precursor; construct comprising an 

10 b) making . a nucleic ^^^n^ises- 

. wherein the unit comprises, 

amplification unit, wherein 

i) an expression cassette comprising at 

. ,ene of ^^ZV^ of the chromosomal gene of step 
U) an ejcpressable c^py ^^^^ ^^^^ 

^5 a) , wherein the unic xii a 

chromosome; . . ^ ^.q the 

,nt.oaa=in, .he nucleic ac« ~t of step 
.ost cell of step a, . »he«.n at least one J ^^^^ 
an^lification unit integrates .nto 

.;™i::tin. t. .st ceU Of step c, in^^a^^^- 

— crofC-i.ca:L----- 
..ltiplie. on the host ^ _ „sc^lly 

- e, selecting a Host ^^^^^^ optionally 

integrated copres J^/^ ^^^^^ „ e, using the 

^, performing °- ^ „e„ cycle, wherein 

host cell selected in step e) in 
the nun^er of chromosomlly Integrated copies 
a-nplifioation unit increases with each repeat. 

3. . n^thod for constructing a host cell ^'^^^^^^ ^^J^":^ 
one copy of an an^lif ication unit integrated into the 

X. ^ir, vv,P> method comprises the steps 
chromosome, wherein the mecn f ^^^^ 

3. a) rendering a chro^so^i^ g^e of a ^^^^^^^^^^ 
fxinctional, wherein the host ceii 
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inhibitory compound endogenously produced by the host cell 
when the host cell is cultivated in a medium comprising a 
precursor; 

b) making a nucleic acid construct comprising an 
5 anqplif ication unit, wherein the unit comprises: 

i) an expression cassette comprising at least one copy of 
a gene of interest; and 

ii) an expressable copy of the chromosomal gene of step 
a) , wherein the unit integrates into the host cell 

10 chromosome; 

c) introducing the nucleic acid construct of step b) into the 
host cell of step a) and cultivating the host cell in a 
medium comprising the precursor, wherein at least one copy 
of the amplification unit integrates into the host cell 

15 chromosome ; and 

d) selecting a host cell comprising at least one 
chromosomally integrated copy of the amplification unit. 

3 ■ A method for increasing the number of copies of an 
20 amplification unit integrated into a host cell chromosome, 
wherein the method comprises the steps of: 

a) providing a host cell, wherein a chromosomal gene has been 
rendered non- functional, whereby the host cell becomes 
susceptible to an inhibitory compound endogenously produced 

25 by the host cell when the host cell is cultivated in a 

medium comprising a precursor; 

b) introducing a nucleic acid construct into the host cell of 
step a) , the nucleic acid construct comprising an 
amplification unit, wherein the unit comprises: 

30 i) an expression cassette comprising at least one copy of 

a gene of interest; and 
ii) an expressable copy of the chromosomal gene of step 
a), 

wherein at least one copy of the amplification unit 
35 integrates into the host cell chromosome; 
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c) cultivating the host cell of step b) in a medium 
comprising the precursor, wherein a chromosomally 
integrated copy of the amplification unit is duplicated or 
multiplied on the host cell chromosome; 

5 d) selecting a host cell comprising two or more chromosomally 
integrated copies of the amplification unit; and optionally 
e) performing one or more cycles of steps c) and d) using the 
host cell selected in step d) in each new cycle; wherein 
the number of chromosomally integrated copies of the 

10 amplification unit increases with each cycle. 

4. The method of any of claims 1-3, wherein the host cell is 
a Gram-positive bacterial cell, preferably a Bacillus cell, 
more preferably a Bacillus cell of a species chosen from the 

^s group consisting of Bacillus alkalophilus, Bacillus 
amyloliquefaciens. Bacillus brevis. Bacillus circulans. 
Bacillus clausii. Bacillus coagulans. Bacillus lautus, Bacillus 
lentus, Bacillus lichenif ormis. Bacillus megaterium. Bacillus 
stearothermophilus. Bacillus subtilis, and Bacillus 

20 thuringiensis; and most preferably a Bacillus lichenif ormis 
cell. 

5 The method of any of claims 1-4, wherein the chromosomal 
gene of step a) encodes an enzyme, preferably chosen from the 
25 group consisting of galactokinase (EC 2.7.1.6). UTP-dependent 
pyrophosphorylase (EC 2 . 7 . 7 . 10) , UDP-glucose-dependent 
uridylyltransf erase (EC 2.7.7.12), UDP-galactose epimerase (EC 
5.1.2.3). 

30 6. The method of any of claims 1-4, wherein the chromosomal 
gene of step a) encodes an enzyme with UDP-galactose epimerase 
activity (EC 5.1.2.3). - 

7. The method of any of claims 1-4, wherein the chromosomal 

35 gene of step a) is galE. 
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8, The method of any of claims 1-7, wherein the inhibitory 
compound is UDP-galactose . 

9. The method of any of claims 1-8, wherein the precursor is 
5 free galactose, preferably free D-galactose, 

dO. The method of any of claims 1-8, wherein the precursor 
can be degraded to produce free galactose, or preferably free 
D-galactose. 

10 

11. The method of any of claims 1-8, wherein the precursor .is 
lactose, melibiose, raffinose, stachyose, verbascose or 
galactinol . 

15 12. The method of any of claims 1 - 8, wherein the medium 
comprises an enzyme capable of degrading the precursor to 
produce free galactose, or preferably free D-galactose. 

13. The method of any of claims 1-8, wherein the host cell 
20 secretes an enzyme into the medium which is capable of 

degrading the precursor to produce free galactose, or 
preferably free D-galactose. 

14. The method of claims 12 or 13, wherein the enzyme is a 
25 galactosidase, preferably an alf a-galactosidase or a beta- 

galactosidase - 

15. The method of any of claims 1-14, wherein the nucleic 
acid construct is a plasmid. 

30 

16. The method of any of claims 1-15, wherein the nucleic 
acid construct further comprises an antibiotic selection 
marker, preferably flanked by by resolvase sites or res-sites. 

35 17. The method of any of claims 1 - 15, wherein the 
amplification unit further comprises a nucleotide sequence with 
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a homology to a chromosomal nucleotide sequence of the host 
cell sufficient to effect chromosomal integration in the host 
cell of the amplification unit by homologous recombination. 

5 18. The method of any of claims i - 15, wherein the 
amplification unit further comprises a nucleotide sequence of 
at least 100 bp, preferably 200 bp, more preferably 300 bp 
even more preferably 400 bp, and most preferably at least 500 
bp with an identity of at least 70%, preferably 80%, more 
10 preferably 90%, even more preferably 95%, and most preferably 
at least 98% identity to a chromosomal nucleotide sequence of 
the host cell. 

19. The method of claims 17 or 18, wherein the nucleotide 
15 sequence comprised in the amplification unit is a partial non- 
functional copy of a conditionally essential, gene of the host 
cell, wherein the host cell prior to the first step of the 
invention has had the conditionally essential gene rendered non 
functional by a partial deletetion, and wherein a recombination 
20 event between the partial copy of the gene comprised in the 
amplification unit and the partial chromosomal gene restores a 
functional chromosomal gene. 

20. The method of claim 19, wherein the conditionally essential 
25 gene encodes a D-alanine racemase, preferably the conditionally 

essential gene is dal . 

21. The method of claim 19 or 20, wherein a first amplification 
unit integrates into the host cell chromosome by homologous 

30 recombination with the partially deleted conditionally 
essential gene and renders the gene functional. 

22. The method of any of claims 1 - 21, wherein the 
amplification unit further comprises an antibiotic marker, 

35 preferably flanked by resolvase sites or res-sites. 
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23. The method of claim 22, wherein a host cell comprising a 
first chromosomally integrated amplification unit is selected 
and the antibiotic marker excised from the host cell chromosome 
by a resolvase prior to the next step in the method. 

24. The method of any of claims 1-23, wherein the gene of 
interest encodes an polypeptide of interest . 

25. The method of claim 24, wherein the polypeptide is an 
enzyme such as a protease; a cellulase; a lipase; a xylanase; a 
phospholipase; or preferably an amylase. 

26. The method of claim 24, wherein the polypeptide is a 
hormone, a pro-hormone, a pre-pro-hormone, a small peptide, a 
receptor, or a neuropeptide. 

27. The method of any of claims 1 - 26, wherein the expressable 
copy of the chromosomal gene comprised in an amplification unit 
integrated in the host cell chromosome has a reduced 
transcription level compared to the transcription level of the 
wild type gene of the host cell, preferably the transcription 
level is reduced with a factor of 100, preferably 50, more 
preferably 10, even more preferably 5, and most preferably with 
a factor of 2 . 

28. The method of any of claims 1-27, wherein the expressable 
copy of the chromosomal gene comprised in the amplification 
unit is promoterless . 

29. The method of any of claims 1-28, wherein the expressable 
copy of the chromosomal gene comprised in the amplification 
unit has a transcription terminator located upstream of the 
gene . 

30. The method of any of claims 1-29, wherein the gene of 
interest is located upstream of the expressable copy of the 
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chromosomal gene within the amplification unit and wherein the 
two genes are co-direct ionally transcribed. 

31. The method of claim 30, wherein the expressable copy of the 
chromosomal gene is expressed by read-through transcription 
from the gene of interest. 

32. An amplification unit comprising: 

a) an expression cassette comprising at least one copy of a 
gene of interest; and 

b) an expressable copy of a conditionally essential 
chromosomal gene of a host cell; wherein the unit 
integrates into the host cell chromosome upon introduction 
of the nucleic acid construct into the host cell. 

33. The unit of claim 32, wherein the chromosomal gene encodes 
an enzyme, preferably chosen from the group consisting of 
galactokinase (EC 2.7.1.6), UTP-dependent pyrophosphorylase (EC 
2.7.7.10), UDP-glucose-dependent uridylyltransf erase (EC 
2 . 7 . 7 . 12 ) , UDP-galactose epimerase (EC 5.1,2.3). 

34. The unit of claims 32 or 33, wherein the chromosomal gene 
encodes an enzyme with UDP-galactose * epimerase activity (EC 
5.1.2.3) . 

35. The unit of claims 32 or 33, wherein the chromosomal gene 
is galE. 

36. The unit of any of claims 32 - 35, wherein the gene of 
interest encodes an polypeptide of interest. 

37. The unit of claim 36, wherein the polypeptide is an enzyme 
such as a protease; a cellulase; a lipase; a xylanase; a 
phospholipase; or preferably an amylase. 
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38. The unit of claim 36, wherein the polypeptide is a hormone, 
a pro-hormone, a pre-pro-hormone, a small peptide, a receptor, 
or a neuropeptide. 

39. The unit of any of claims 32 - 38, wherein the expressable 
copy of the chromosomal gene is promoterless • 

40. The unit of any of claims 32 - 39, wherein the expressable 
copy of the chromosomal gene has a transcription terminator 
located upstream of the gene, 

41. The unit of any of claims 32 - 40, wherein the gene of 
interest is located upstream of the expressable copy of the 
chromosomal gene and wherein the two genes are co-directionally 
transcribed. 

42. The unit of claim 41, wherein the expressable copy of the 
chromosomal gene" is expressed by read-through transcription 
from the gene of interest. 

43. The unit of any of claims 32 - 42, which further comprises 
an antibiotic marker, preferably flanked by resolvase sites or 
res-sites . 

44. A nucleic acid construct comprising a unit as defined in 
any of claims 32 - 43 . 

45. A host cell wherein a chromosomal gene has been rendered 
non- functional leaving the host cell susceptible to an 
inhibitory compound endogenously produced by the host cell when 
cultivated in a medium comprising a precursor; and wherein the 
host cell comprises an amplification unit as defined in any of 
claims 32 - 43 or a nucleotide construct as defined in claim 
44. 
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46. The host cell of claim 45, wherein the host cell is a Gram- 
positive bacterial cell, preferably a Bacillus cell, more 
preferably a Bacillus cell of a species chosen from the group 
consisting of Bacillus alkalophilus , Bacillus 
amyloliquefaciens. Bacillus brevis. Bacillus circulans. 
Bacillus clausii. Bacillus coagulans, Bacillus lautus, Bacillus 
lentus. Bacillus licheniformis. Bacillus megaterium. Bacillus 
stearothermophilus. Bacillus subtilis, and Bacillus 
thuringiensis; and most preferably a Bacillus licheniformis 
cell. 

47. The host cell of claims 45 or 46, wherein the chromosomal 
gene encodes an enzyme, preferably the enzyme is chosen from 
the group of enzymes consisting of galactokinase (EC 2.7.1.6), 
UTP-dependent pyrophosphorylase (EC 2.7.7.10), UDP-glucose- 
dependent uridylyltransf erase (EC 2.7.7.12), UDP-galactose 
epimerase (EC 5.1.2.3), more preferably the enzyme is an UDP- 
galactose epimerase (EC 5.1.2.3), and most preferably the 
enzyme . is encoded by galE . 

48. The host cell of any of claims 45 - 47, where the 
inhibitory compound is UDP-galactose. 

49. The host cell of any of claims 45-48, where the precursor 
is free galactose, preferably free D-galactose. 

50. The host cell of any of claims 45 - 48, where the precursor 
can be degraded to produce free galactose, or preferably free 
D-galactose . 

51. The host cell of any of claims 45 - 48, where the precursor 
is lactose, melibiose, raffinose, stachyose, verbascose or 
galactinol . 
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52. The host cell of any of claims 45 - 51, where the medium 
comprises an enzyme capable of degrading the precursor to 
produce free galactose, or preferably free D-galactose. 

53. The host cell of any of claims 45-52, where the host cell 
secretes an enzyme into the medium which is capable of 
degrading the precursor to produce free galactose, or 
preferably free D-galactose. 

54. The host cell of claims 52 or 53, where the enzyme is a 
galactosidase, preferably an alf a-galactosidase or a beta- 
galactosidase . 

55. The host cell of any of claims 45 - 54, wherein the 
amplification unit further comprises a nucleotide sequence of 
at least 100 bp, preferably 200 bp, more preferably 300 bp, 
even more preferably 400 bp, and most preferably at least 500 
bp with an identity of at least 70%, preferably 80%, more 
preferably 90%, even more preferably 95%, and most preferably 
at least 98% identity to a chromosomal nucleotide sequence of 
the host cell. 

56. The host cell of claim 55 wherein the nucleotide sequence 
comprised in the amplification unit is a partial non- functional 
copy of a conditionally essential gene of the host cell, 
wherein the host cell has had the conditionally essential gene 
rendered non functional by a partial deletetion, and wherein a 
recombination event between the partial copy of the gene 
comprised in the amplification unit and the partial chromosomal 
gene has restored a functional chromosomal gene. 

57. The host cell of claim 56 wherein the conditionally 
essential gene encodes a D-alanine racemase, preferably the 
conditionally essential gene is dal. 
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58. The host cell of any of claims 45 - 57 wherein the 
expressable copy of the chromosomal gene of the amplification 
unit has a reduced transcription level compared to the 
transcription level of the wild type gene of the host cell, 
preferably the transcription level is reduced with a factor of 
100, preferably 50, more preferably 10, even more preferably 5, 
and most preferably with a factor of 2. 

59. A process for producing a polypeptide of interest, wherein 
the process comprises a step of cultivating a host cell as 
defined in any of claims 45-58. 

60. The process of claim 59 wherein the polypeptide is an 
enzyme such as a protease; a cellulase; a lipase; a xylanase; a 
phospholipase; or preferably an amylase. 

61. The process of claim 59 wherein the polypeptide is a 
hormone, a pro-hormone, a pre -pro -hormone, a small peptide, a 
receptor, or a neuropeptide. 
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Southern analysis on different ampMed clones. 

Numbers inparenlhesis refer to the clone ninnbersmtaU^ 1. 
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Marker, Lambda-HindHI digest 
PL1801 

Two copy strain 
Singlecopy strain (#2) 
Multicopy by galactose (#9) 
Multicopy by kanamycme (#13) 
Multicopy by kanamycine (#14) 
Multicopy by galactose (#15) 
Multicopy by galactose (#17) 
Multicopy by kanamycine (#19) 
Multicopy by galactose (#21) 
Multicopy by galactose (#25) 
Multicopy by galactose (#27) 
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SEQUENCE LISTING 

<110> Novo Nordiak A/S 

<120> Method for Increasing Gene Copy Number 

<130> 10028. 204-WO 

<140> 
<141> 

<160> 12 

<170> PatentIn Ver. 2.1 

<210> 1 
<211> 6405 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pMOL1748 
<400> 1 

ctaaatcggt agaagcccaa acgttccacg atgcgatttg tgcccttatc gtagaagagc 60 
tgtttgaata tgcaggcaaa tggcgtaata ttcgtgtgca aggaccgaca acatttctac 120 
catccttgac tgtacaggta gcaatggcag gtgccatgtt gattggtctg catcatcgca 180 
tctgttatac gacgagcgct tcggtcttaa ctgaagcagt taagcaatca gatcttcctt 240 
caggttatga ccatctgtgc cagttcgtaa tgtctggtca actttccgac tctgagaaac 300 
ttctggaatc gctagagaat ttctggaatg ggattcagga gtggacagaa cgacacggat 360 
atatagtgga tgtgtcaaaa cgcataccat tttgaacgat gacctctaat aattgttaat 420 
catgttggag ctcagtgaga gcgaagcgaa cacttgattt tttaattttc tatcttttat 480 
aggtcattag agtatactta tttgtcctat aaactattta gcagcataat agatttattg 540 
aataggtcat ttaagttgag catattagag gaggaaaatc ttggagaaat atttgaagaa 600 
cccgagaatg gaggccttct caattgagaa ggcctttttt aaagaacaag ggtgcctaaa 660 
caggcaccct tgttagctgt tatttgattt tcacaataac atcattactg aattttagtt 720 
tccaagtgcc ttttgcataa gcttccttgt caacttcaaa tgcttttaca cctgttactt 780 
taatattagg atttagatca ctcaaaattt tagagttatc aacttttgtc tcagttgcat 840 
agtttacaga agcatcaata tcagaatcat aagaagtacc atcagcatca actaatttaa 900 
cagttggaat tgaaaaagag ctaatcggct ttttagatac gtttttaatt gtatattgaa 960 
cagctacaat tgtacctcag cggcgcagcg ggtcgacgcg gccgcaacca tttgatcaaa 1020 
gcttgcatgc ctgcaggtcg attcacaaaa aataggcaca cgaaaaacaa gttaagggat 1080 
gcagtttatg catcccttaa cttacttatt aaataattta tagctattga aaagagataa 1140 
gaattgttca aagctaatat tgtttaaatc gtcaattcct gcatgtttta aggaattgtt 12 00 
aaattgattt tttgtaaata ttttcttgta ttctttgtta acccatttca taacgaaata 1260 
attatacttt tgtttatctt tgtgtgatat tcttgatttt tttctactta atctgataag 1320 
tgagctattc actttaggtt taggatgaaa atattctctt ggaaccatac ttaatataga 1380 
aatatcaact tctgccatta aaagtaatgc caatgagcgt tttgtattta ataatctttt 1440 
agcaaacccg tattccacga ttaaataaat ctcattagct atactatcaa aaacaatttt 1500 
gcgtattata tccgtactta tgttataagg tatattacca tatattttat aggattggtt 1560 
tttaggaaat ttaaactgca atatatcctt gtttaaaact tggaaattat cgtgatcaac 1620 
aagtttattt tctgtagttt tgcataattt atggtctatt tcaatggcag ttacgaaatt 1680 
acacctcttt actaattcaa gggtaaaatg gccttttcct gagccgattt caaagatatt 1740 
atcatgttca tttaatctta tatttgtcat tattttatct atattatgtt ttgaagtaat 1800 
aaagttttga ctgtgtttta tatttttctc gttcattata accctcttta atttggttat 1860 
atgaattttg cttattaacg attcattata accacttatt ttttgtttgg ttgataatga 1920 
actgtgctga ttacaaaaat actaaaaatg cccatatttt ttcctcctta taaaattagt 1980 
ataattatag cacgagctct gataaatatg aacatgatga gtgatcgtta aatttatact 2040 
gcaatcggat gcgattattg aataaaagat atgagagatt tatctaattt cttttttctt 2100 
gtaaaaaaag aaagttctta aaggttttat agttttggtc gtagagcaca cggtttaacg 2160 
acttaattac gaagtaaata agtctagtgt gttagacttt atgaaatcta tatacgttta 2220 
tatatattta ttatccggag gtgtagcatg tctcattcaa ttttgagggt tgccagagtt 2280 
aaaggatcaa gtaatacaaa cgggatacaa agacataatc aaagagagaa taaaaactat 2340 
aataataaag acataaatca tgaggaaaca tataaaaatt atgatttgat taacgcacaa 2400 

1 
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^aat^f^ ^^^^""^ aattgatgaa acgattgatg agaattattc agggaaacgt 2460 
StStaaff h«fo^^°^'' tcgacatgtg gacggaetgg ttacaagtga tlllgattfc 2520 
tttgatgatt taagcggaga agaaatagaa cgatttttta aagatagctt ggaqtttcta 2580 

SSc l^tlt^tl t^T^?n^ tatgogactg tcfatc?gga gSgaS lllo 
ccacacatgc actttggttt tgtcccttta acagaggacg ggagattqtc tacaaaaoaa ?70n 
cagttaggca acaagaaaga ctttactcaa ttacaa^tl Utttaataa gtatSga^ llin 
gagaaaggtt atgaacttga aagaggcacg tccaaagagg ttacag^aS agaacataaa 282? 
gcgat^atc agtacaagaa agatactgta tttcatlall aggaaltgS aSaStaS lllo 
S^IcS^t^ Iff^^''^^ taagcagtta cagagtggaa tgagcatat gg^Sg llTo 
a^ccctttg attatgaaaa tgagogtaca ggtttgttct ctggacgtga agagactqqt 3000 
agaaagatat taactgctga tgaatttgaa cgcctgcaag aalSalfctc ttctaca^a lola 
tSaS^S ^tall^"^^ aaatabtaag agcacfgac? attacacaga aaatlSgS 312? 

gtagagagag tttgaaagaa gtagtgaata catggaaaga ggggtatcac 3180 
aagaggttaa taaattaaag cgagagaatg ataifttgia ?£|cagSg 32J0 
fcSrS^S^? agaaatttca agctagtaca gtgactttat atcitgctgc gaggSS tlto 
ttccctgggt ttgagaaagg gtttaatagg cttaaagaga aattctttaa tgattccaS 33S0 

aaoc^S^ tgggacagtt tatggatgtt gtacaggata atgtccagaa ggcgaSS 342? 

aagcgtgaga aacagcgtac agacgattta gagatgtaga ggtactttta tqccfaaaaa Ilia 

actttttgcg tgtgacagtc cttaaaatat acttagaglg ffagcgaaag tfS^Scqac 35?? 

agctattaac tttcggtttc aaagctctag gatttttlaf ggacglagS ca?SSSqc 360? 

aaaaaggaaa ttggaataaa tgcgaaattt gagatgttaa tlaalgafS SttqaqSc 366? 

cccccSS tSciS^ Tlfl^^^ gggaglaaac atagg^ggt aSglcct 37"? 

tt^^aa?t ^^ttltf^ ccattgtcca aacaaataaa taaatattgg gtttttaatg 3780 

tcaaaaggtt gttttttatg ttaaagtgaa aaaaacagat gttgggaggt acaatoataa 3BAft 

l^.attlff ^^^^9^^9f9 aaaaaagttg ctgttac?tt laglStaS acagaSaS 390? 

tStf^^^^^ aaatagaatc aaagaaaaat ataatattag caaatcagat gcalccSS 396? 

aao^n^^ aaaatatgca aaggaggaat acggtgcatt ttaaacaaaa faagatSac 402? 

t^^J^^^ tgctgcctat ctatgactaa attttgttaa gtgtattagc acoittaSI 4?8? 

tatcatg^gc gaaaatgtaa taaaagaaac tgaaaacaag aaaaattcL gaggaStS 414? 

gScaca" SaStataa t^T'^T. ^^^^^^ag tggttagagt It?ga£S 420? 
gtcacacact caatttgtag tgtctccatt acatgatagg gatacttrata caaaaaahSa A^t^n 

SSSata ItlacaaSo aSf^'?^ SatgLtgS LtaatSat ctfSSaS tltl 
^^ff^l^ attacagaag aattgaatgc gactattccg cagattgcag gaagtgtgaa 4380 
aggtcttgtg agatatatgc ttcacatgga cgatcctaat aaatttaaat atclaiaaaa 444? 
tfStSaaa ?t2ttff'^ ^tgtagatgt tgatgaatta ttaaaga^a Saacaga «o? 
tagatataaa ttaattaaag aaatgattga gtttattgat gaacaaggaa tcotaaaatt asko 
taagagttta atggattatg caatgaagtt taaatttiat gattgg?fcc cgrtt?tSg tllo 
Saatt^ta Itfltl^T f^^^^a^^*^^ tataaaatca latclitata aatc^gacc^ 468? 
atagattttg aatttaggtg tcacaagaca ctcttttttc gcaccagcga aaactoattt 474o 
acfSSi t?cScaIa^ -taatcgact ctagaggatc ^tttta^c agctglfttc tltl 
actttttgca ttctacaaac tgcataactc atatgtaaat cgctcctttt tacmtaQcac 4860 
acS?a?Sf cataaa^^ct 2"?°?^°" accacttcca a^taaagtat aallclltat till 
aaaccStaa aaSt^?^?^ aggctgtcgg cagtgccgac caaaaccata 4980 

n^^^l ■ Sr^^^'^ttctt ttttttacga gaaaaaagaa acaaaaaaac ctgccctctg 5040 
ao^»^^?^?^ ^^3999^t tttgctctcg tgctcgttta aaaatcagca a^gacaggl llOO 
gagaagatca ctcaaaaaat ctccaccttt aaacccttgc c^tttttat 516? 
"tSaoSt taccgaaagc cagactcagc aagaataala tttttattgt 522? 

ctctcStc ttttSS^« cggacaaaac cactcaaaat aaaaaagata caagagagit 5280 
ctctcgtatc ttttattcag caatcgcgcc cgattgctga acagattaat aatgaqccqc 5340 
aaZtaT.r ^^f''^^^'''' tgatgataca agggcaaaac agctttgctt caccgctt|c 54S? 

^^'r^^^^ tgattcacca gtattgcggg ccgacaccgc ctgacaagfa 5460 
^tStSS ^tnll^^"^^ tctatgcttt agatgctgag ctgaatctlo agccgggc?t 552? 
aJtact™ taa^^t^'''' aagaaatgaa agagcacatt cttgctgaaa cctctltcga 5580 
^^^ff^ff^^ t^f^^^^^^ ctaaaaaata tgaaaaaact attaataaac gattaaactt 5640 
cttaaaaatg gatgtggacc ggttctgaat tctgatcaaa tggttcagtg agagcgaaqc 5700 
tataaactft t^aqS^'." ttctatcttt tataggtcat tSgtaSI t^^^lt TlSO 
aa^aa™ lllfrlf^^^ aatagattta ttgaataggt catttaagtt gagcatatta 5820 
f™f^h aatatttgaa gaacccgaac gcgtgagtag ttcaacaaac 5880 

gggccagttt gttgaagatt agatgctata attgttatta aaagiattga aggatgctta 5940 
Iaqcl^a?c? ctgaataaga acggtgctct ccalltaJtc t?l£?agaa SJ? 

ctfa^^«oa 9aaaagggaa tgagaatagt gaatggacca ataataaEga 6060 

ctagagaaga aagaatgaag attgttcatg aaattaagga acqaatatta aataaatata 6120 
Satit Sa^f f^^^^^-^^ gctctctti tc^tcSac? SSSccc? tllo 
attcggatat tgagatgatg tgtgtcatgt caacagagga agcagagttc agccatgaat 6240 
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atgcatctca ggtggaatca . gattggccgc 
cgatttatga ttcaggtgga tacttagaga 
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attttgatag cgaagagatt ctactagatt 6300 
ttacacatgg tcaatttttc tctattttgc 6360 
aagtgtatca aactg 6405 



<210> 2 
<211> 5943 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pMOL1807 
<400> 2 

gatccatctg aaggtcgata cggggatgaa cagacttggt gtaaaaacag aggaagaagt 60 
tcagaacgtg atggcaattc ttgaccgcaa ccctcgttta aagtgcaaag gggtatttac 120 
ccattttgcg acagcggatg aaaaagaaag aggctatttc ttaatgcagt ttgagcgctt 180 
taaagagctg attgctccgc tgccgttaaa gaatctaatg gtccactgcg cgaacagcgc 240 
cgctggactc cggctgaaaa aaggcttttt taatgcagtc agattcggca tcggcatgta 300 
tggccttcgc ccgtctgctg acatgtcgga cgagataccg tttcagctgc gtccggcatt 360 
taccctgcat tcgacactgt cacatgtcaa actgatcaga aaaggcgaga gcgtcagcta 420 
cggagccgag tacacagcgg aaaaagacac atggatcggg acggtgcctg taggctatgc 480 
ggacggctgg ctccgaaaat tgaaagggac cgacatcctt gtgaagggaa aacgcctgaa 540 
aattgccggc cgaatttgca tggaccaatt tatggtggag ctggatcagg aatatccgcc 600 
gggcacaaaa gtcacattaa taggccggca gggggatgaa tatatttcca tggatgagat 660 
tgcaggaagg ctcgaaacca ttaactatga ggtggcctgt acaataagtt cccgtgttcc 720 
ccgtatgttt ttggaaaatg ggagtataat ggaagtaaga aatcctttat tgcaggtaaa 780 
tataagcaat taacttacct aaatggagaa ttcataaaac agctttgcgt cgacgatgaa 840 
gatggatttt ctattattgc aatgtggaat tgggaacgga aaaattattt tattaaagag 900 
tagttcaaca aacgggccag tttgttgaag attagatgct ataattgtta ttaaaaggat 960 
tgaaggatgc ttaggaagac gagttattaa tagctgaata agaacggtgc tctccaaata 1020 
ttcttattta gaaaagcaaa tctaaaatta tctgaaaagg gaatgagaat agtgaatgga 1080 
ccaataataa tgactagaga agaaagaatg aagattgttc atgaaattaa ggaacgaata 1140 
ttggataaat atggggatga tgttaaggct attggtgttt atggctctct tggtcgtcag 1200 
actgatgggc cctattcgga tattgagatg atgtgtgtca tgtcaacaga ggaagcagag 1260 
ttcagccatg aatggacaac cggtgagtgg aaggtggaag tgaattttga tagcgaagag 1320 
attctactag attatgcatc tcaggtggaa tcagattggc cgcttacaca tggtcaattt 1380 
ttctctattt tgccgattta tgattcaggt ggatacttag agaaagtgta tcaaactgct 1440 
aaatcggtag aagcccaaac gttccacgat gcgatttgtg cccttatcgt agaagagctg 1500 
tttgaatatg caggcaaatg gcgtaatatt cgtgtgcaag gaccgacaac atttctacca 1560 
tccttgactg tacaggtagc aatggcaggt gccatgttga ttggtctgca tcatcgcatc 1620 
tgttatacga cgagcgcttc ggtcttaact gaagcagtta agcaatcaga tcttccttca 1680 
ggttatgacc atctgtgcca gttcgtaatg tctggtcaac tttccgactc tgagaaactt 1740 
ctggaatcgc tagagaattt ctggaatggg attcaggagt ggacagaacg acacggatat 1800 
atagtggatg tgtcaaaacg cataccattt tgaacgatga cctctaataa ttgttaatca 1860 
tgttggttac gtatttatta acttctccta gtattagtaa ttatcatggc tgtcatggcg 1920 
cattaacgga ataaagggtg tgcttaaatc gggccatttt cgctaataag aaaaaggatt 1980 
aattatgagc gaattgaatt aataataagg taatagattt acattagaaa atgaaagggg 2040 
attttgcggc cgccaacctc gagatctctt agatttttgg ggttatttag gggagaaaac 2100 
ataggggggt actacgacct cccccctagg tgtccattgt ccattgtcca aacaaataaa 2160 
taaatattgg gtttttaatg ttaaaaggtt gttttttatg ttaaagtgaa aaaaacagat 2220 
gttgggaggt acagtgatgg ttgtagatag aaaagaagag aaaaaagttg ctgttacttt 2280 
aagacttaca acagaagaaa atgagatatt aaataggaat tcgagctcat tattaatctg 2340 
ttcagcaatc gggcgcgatt gctgaataaa agatacgaga gacctctctt gtatcttttt 2400 
tattttgagt ggttttgtcc gttacactag aaaaccgaaa gacaataaaa attttattct 2460 
tgctgagtct ggctttcggt aagctagaca aaacggacaa aataaaaatt ggcaagggtt 2520 
taaaggtgga gattttttga gtgatcttct caaaaaatac tacctgtccc ttgctgattt 2580 
ttaaacgagc acgagagcaa aacccccctt tgctgaggtg gcagagggca ggtttttttg 2640 
tttctttttt ctcgtaaaaa aaagaaaggt cttaaaggtt ttatggtttt ggtcggcact 2700 
gccgacagcc tcgcagagca cacactttat gaatataaag tatagtgtgt tatactttac 2760 
ttggaagtgg ttgccggaaa gagcgaaaat gcctcacatt tgtgccacct aaaaaggagc 2820 
gatttacata tgagttatgc agtttgtaga atgcaaaaag tgaaatcagc tggactaaaa 2880 
ggcagagctc ggtacccggg agctctatca attggtaact gtatctcagc ttgaagaagt 2940 
gaagaagcag agaggctatt gaataaatga gtagaagcgc catatcggcg cttttctttt 3000 
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ggacaaccgg tgagtggaag gtggaagtga 
atgcatctca ggtggaatca gattggccgc 
cgatttatga ttcaggtgga tacttagaga 



10028 

attttgatag cgaagagatt ctactagatt 6300 
ttacacatgg tcaatttttc tctattttgc 6360 
aagtgtatca aactg 6405 



<210> 2 
<211> 5943 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pMOL1807 
<400> 2 

gatccatctg aaggtcgata cggggatgaa cagacttggt gtaaaaacag aggaagaagt 60 
tcagaacgtg atggcaattc ttgaccgcaa ccctcgttta aagtgcaaag gggtatttac 120 
ccattttgcg acagcggatg aaaaagaaag aggctatttc ttaatgcagt ttgagcgctt 180 
taaagagctg attgctccgc tgccgttaaa gaatctaatg gtccactgcg cgaacagcgc 240 
cgctggactc cggctgaaaa aaggcttttt taatgcagtc agattcggca tcggcatgta 300 
tggccttcgc ccgtctgctg acatgtcgga cgagataccg tttcagctgc gtccggcatt 360 
taccctgcat tcgacactgt cacatgtcaa actgatcaga aaaggcgaga gcgtcagcta 420 
cggagccgag tacacagcgg aaaaagacac atggatcggg acggtgcctg taggctatgc 480 
ggacggctgg ctccgaaaat tgaaagggac cgacatcctt gtgaagggaa aacgcctgaa 540 
aattgccggc cgaatttgca tggaccaatt tatggtggag ctggatcagg aatatccgcc 600 
gggcacaaaa gtcacattaa taggccggca gggggatgaa tatatttcca tggatgagat 660 
tgcaggaagg ctcgaaacca ttaactatga ggtggcctgt acaataagtt cccgtgttcc 720 
ccgtatgttt ttggaaaatg ggagtataat ggaagtaaga aatcctttat tgcaggtaaa 780 
tataagcaat taacttacct aaatggagaa ttcataaaac agctttgcgt cgacgatgaa 840 
gatggatttt ctattattgc aatgtggaat tgggaacgga aaaattattt tattaaagag 900 
tagttcaaca aacgggccag tttgttgaag attagatgct ataattgtta ttaaaaggat 960 
tgaaggatgc ttaggaagac gagttattaa tagctgaata agaacggtgc tctccaaata 1020 
ttcttattta gaaaagcaaa tctaaaatta tctgaaaagg gaatgagaat agtgaatgga 1080 
ccaataataa tgactagaga agaaagaatg aagattgttc atgaaattaa ggaacgaata 1140 
ttggataaat atggggatga tgttaaggct attggtgttt atggctctct tggtcgtcag 1200 
actgatgggc cctattcgga tattgagatg atgtgtgtca tgtcaacaga ggaagcagag 1260 
ttcagccatg aatggacaac cggtgagtgg aaggtggaag tgaattttga tagcgaagag 1320 
attctactag attatgcatc tcaggtggaa tcagattggc cgcttacaca tggtcaattt 1380 
ttctctattt tgccgattta tgattcaggt ggatacttag agaaagtgta tcaaactgct 1440 
aaatcggtag aagcccaaac gttccacgat gcgatttgtg cccttatcgt agaagagctg 1500 
tttgaatatg caggcaaatg gcgtaatatt cgtgtgcaag gaccgacaac atttctacca 1560 
tccttgactg tacaggtagc aatggcaggt gccatgttga ttggtctgca tcatcgcatc 1620 
tgttatacga cgagcgcttc ggtcttaact gaagcagtta agcaatcaga tcttccttca 1680 
ggttatgacc atctgtgcca gttcgtaatg tctggtcaac tttccgactc tgagaaactt 1740 
ctggaatcgc tagagaattt ctggaatggg attcaggagt ggacagaacg acacggatat 1800 
atagtggatg tgtcaaaacg cataccattt tgaacgatga cctctaataa ttgttaatca 1860 
tgttggttac gtatttatta acttctccta gtattagtaa ttatcatggc tgtcatggcg 1920 
cattaacgga ataaagggtg tgcttaaatc gggccatttt cgctaataag aaaaaggatt 1980 
aattatgagc gaattgaatt aataataagg taatagattt acattagaaa atgaaagggg 2040 
attttgcggc cgccaacctc gagatctctt agatttttgg ggttatttag gggagaaaac 2100 
ataggggggt actacgacct cccccctagg tgtccattgt ccattgtcca aacaaataaa 2160 
taaatattgg gtttttaatg ttaaaaggtt gttttttatg ttaaagtgaa aaaaacagat 2220 
gttgggaggt acagtgatgg ttgtagatag aaaagaagag aaaaaagttg ctgttacttt 2280 
aagacttaca acagaagaaa atgagatatt aaataggaat tcgagctcat tattaatctg 2340 
ttcagcaatc gggcgcgatt gctgaataaa agatacgaga gacctctctt gtatcttttt 2400 
tattttgagt ggttttgtcc gttacactag aaaaccgaaa gacaataaaa attttattct 2460 
tgctgagtct ggctttcggt aagctagaca aaacggacaa aataaaaatt ggcaagggtt 2520 
taaaggtgga gattttttga gtgatcttct caaaaaatac tacctgtccc ttgctgattt 2580 
ttaaacgagc acgagagcaa aacccccctt tgctgaggtg gcagagggca ggtttttttg 2640 
tttctttttt ctcgtaaaaa aaagaaaggt cttaaaggtt ttatggtttt ggtcggcact 2700 
gccgacagcc tcgcagagca cacactttat gaatataaag tatagtgtgt tatactttac 2760 
ttggaagtgg ttgccggaaa gagcgaaaat gcctcacatt tgtgccacct aaaaaggagc 2820 
gatttacata tgagttatgc agtttgtaga atgcaaaaag tgaaatcagc tggactaaaa 2880 
ggcagagctc ggtacccggg agctctatca attggtaact gtatctcagc ttgaagaagt 2940 
gaagaagcag agaggctatt gaataaatga gtagaagcgc catatcggcg cttttctttt 3000 



wo 01/90393 



PCT/DKOl/00356 



10028 

ggaagaaaat atagggaaaa tggtacttgt taaaaattcg gaatatttat acaatatcat 3060 
atgttacaca ttgaaagggg aggagaatca tgaaacaaca aaaacggctt tacgcccgat 3120 
tgctgacgct gttatttgcg ctcatcttct tgctgcctca ttctgcagcc gcggcacacc 3180 
ataatggtac gaacggcaca atgatgcagt actttgaatg gtatctacca aatgacggaa 3240 
accattggaa tagattaagg tctgatgcaa gtaacctaaa agataaaggg atctcagcgg 3300 
tttggattcc tcctgcatgg aagggtgcct ctcaaaatga tgtggggtat ggtgcttatg 3360 
atctgtatga tttaggagaa ttcaatcaaa aaggaaccat tcgtacaaaa tatggaacgc 3420 
gcaatcagtt acaagctgcg gttaacgcct tgaaaagtaa tggaattcaa gtgtatggcg 3480 
atgttgtaat gaatcataaa gggggagcag acgctaccga aatggttagg gcagttgaag 3540 
taaacccgaa taatagaaat caagaagtgt ccggtgaata tacaattgag gcttggacaa 3600 
agtttgactt tccaggacga ggtaatactc attcaaactt caaatggaga tggtatcact 3660 
ttgatggagt agattgggat cagtcacgta agctgaacaa tcgaatttat aaatttagag 3720 
gtgatggaaa agggtgggat tgggaagtcg atacagaaaa cggtaactat gattacctaa 3780 
tgtatgcaga tattgacatg gatcacccag aggtagtgaa tgagctaaga aattggggtg 3840 
tttggtatac gaatacatta ggccttgatg gttttagaat agatgcagta aaacatataa 3900 
aatacagctt tactcgtgat tggattaatc atgttagaag tgcaactggc aaaaatatgt 3960 
ttgcggttgc ggaattttggr aaaaatgatt taggtgctat tgaaaactat ttaaacaaaa 4020 
caaactggaa ccattcagtc tttgatgttc cgctgcacta taacctctat aatgcttcaa 4080 
aaagcggagg gaattatgat atgaggcaaa tatttaatgg tacagtcgtg caaagacatc 4140 
caatgcatgc tgttacattt gttgataatc atgattcgca acctgaagaa gctttagagt 4200 
cttttgttga agaatggttc aaaccattag cgtatgcttt gacattaaca cgtgaacaag 4260 
gctacccttc tgtattttat ggagattatt atggcattcc aacgcatggt gtaccagcga 4320 
tgaaatcgaa aattgacccg attctagaag cgcgtcaaaa gtatgcatat ggaagacaaa 4380 
atgactactt agaccatcat aatatcatcg gttggacacg tgaagggaat acagcacacc 4440 
ccaactccgg tttagctact atcatgtccg atggggcagg aggaaataag tggatgtttg 4500 
ttgggcgtaa taaagctggt caagtttgga ccgatatcac tggaaatcgt gcaggtactg 4560 
ttacgattaa tgctgatgga tggggtaatt tttctgtaaa tggaggatca gtttctattt 4620 
gggtaaacaa ataagtcgac ggcccagccg gccgagctcg gatagaagag cagagaagac 4680 
ggatttcctg aaggaaatcc gtttttttat tttgcccgtc ttataaattt ctttgattac 4740 
attttataat taattttaac aaagtgtcat aagcccgatg gaatattgct gaagcttatc 4800 
gataacaggt cattttttag gagggtttac atcatggcaa tacttgttac tggcggtgcc 4860 
ggttacattg gcagccacac atgtgttgaa ctattgaaca gcggctacga gattgttgtt 4920 
cttgataatc tgtccaacag ttcagctgaa gcgctgaacc gtgtcaagga gattacagga 4980 
aaagatttaa cgttctacga agcggattta ttggaccggg aagcggtaga ttccgttttt 5040 
gctgaaaatg aaatcgaagc tgtgattcat tttgcagggt taaaagcagt cggcgaatct 5100 
gtggcgattc ccctcaaata ttatcataac aatttgacag gaacgtttat tttatgcgag 5160 
gccatggaga aatacggcgt caagaaaatc gtattcagtt catctgcgac agtatacggc 5220 
gttccggaaa catcgccgat tacggaagac tttccattag gcgcgacaaa tccttatggg 5280 
cagacgaagc tcatgcttga acaaatattg cgtgatttgc atacagccga caatgagtgg 5340 
agcgttgcgc tgcttcgtta ctttaacccg ttcggcgcgc atccaagcgg acggatcggt 5400 
gaagacccga acggaatccc aaataacctt atgccgtatg tggcacaggt agcagtcggg 5460 
aagctcgagc aattaagcgt attcggaaat gactatccga caaaagacgg gacaggcgta 5520 
cgcgattata ttcacgtcgt tgatctcgca gaaggccacg tcaaggcgct ggaaaaagta 5580 
ttgaactcta caggagccga tgcatacaac cttggaacag gcacaggcta cagcgtgctg 5640 
gaaatggtca aagcctttga aaaagtgtca gggaaagagg ttccataccg ttttgcggac 5700 
cgccgtccgg gagacatcgc cacatgcttt gcagatcctg cgaaagccaa gcgagaacta 5760 
ggctgggaag cgaaacgcgg ccttgaggaa atgtgtgctg attcctggag atggcagtct 5820 
tctaatgtga atgggtataa gagtgcggaa taagaatgga ggccttctca attgagaagg 5880 
ccttttttaa agaacaaggg tgcctaaaca ggcacccttg ttagctgtta tttgattttc 5940 
acg 5943 



<210> 3 
<211> 5793 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pMOL1809 
<400> 3 

gatccatctg aaggtcgata cggggatgaa cagacttggt gtaaaaacag aggaagaagt 60 
tcagaacgtg atggcaattc ttgaccgcaa ccctcgttta aagtgcaaag gggtatttac 120 
ccattttgcg acagcggatg aaaaagaaag aggctatttc ttaatgcagt ttgagcgctt 180 



wo 01790393 PCT/DKOl/00356 



10028 

taaagagctg attgctccgc tgccgttaaa gaatctaatg gtccactgcg cgaacagcgc 240 
cgctggactc cggctgaaaa aaggcttttt taatgcagtc agattcggca tcggcatgta 300 
tggccttcgc ccgtctgctg acatgtcgga cgagataccg tttcagctgc gtccggcatt 360 
taccctgcat tcgacactgt cacatgtcaa actgatcaga aaaggcgaga gcgtcagcta 420 
cggagccgag tacacagcgg aeiaaagacac atggatcggg acggtgcctg taggctatgc 480 
ggacggctgg ctccgaaaat tgaaagggac cgacatcctt gtgaagggaa aacgcctgaa 540 
aattgccggc cgaatttgca tggaccaatt tatggtggag ctggatcagg aatatccgcc 600 
gggcacaaaa gtcacattaa taggccggca gggggatgaa tatatttcca tggatgagat 660 
tgcaggaagg ctcgaaacca ttaactatga ggtggcctgt acaataagtt cccgtgttcc 720 
ccgtatgttt ttggaaaatg ggagtataat ggaagtaaga aatcctttat tgcaggtaaa 780 
tataagcaat taacttaccb aaatggagaa ttcataaaac agctttgcgt cgacgatgaa 840 
gatggatttt ctattattgc aatgtggaat tgggaacgga aaaattattt tattaaagag 900 
tagttcaaca aaogggccag tttgttgaag attagatgct ataattgtta ttaaaaggat 960 
tgaaggatgc ttaggaagac gagttattaa tagctgaata agaacggtgc tctccaaata 1020 
ttcttattta gaaaagcaaa tctaaaatta tctgaaaagg gaatgagaat agtgaatgga 1080 
ccaataataa tgactagaga agaaiagaatg aagattgttc atgaaattaa ggaacgaata 1140 
ttggataaat atggggatga tgttaaggct attggtgttt atggctctct tggtcgtcag 1200 
actgatgggc cctattcgga tattgagatg atgtgtgtca tgtcaacaga ggaagcagag 1260 
ttcagccatg aatggacsiac cggtgagtgg aaggtggaag tgaattttga tagcgaagag 1320 
attctactag attatgcatc tcaggtggaa tcagattggc cgcttacaca tggtcaattt 1380 
ttctctattt tgccgattta tgattcaggt ggatacttag agaaagtgta tcaaactgct 1440 
aaatcggtag aagcccaaac gttccacgat gcgatttgtg cccttatcgt agaagagctg 1500 
tttgaatatg caggcaaatg gcgtaatatt cgtgtgcaag gaccgacaac atttctacca 1560 
tccttgactg tacaggtagc aatggcaggt gccatgttga ttggtctgca tcatcgcatc 1620 
tgttatacga cgagcgcttc ggtcttaact gaagcagtta agcaatcaga tcttccttca 1680 
ggttatgacc atctgtgcca gttcgtaatg tctggtcaac tttccgactc tgagaaactt 1740 
ctggaatcgc tagagaattt ctggaatggg attcaggagt ggacagaacg acacggatat 1800 
atagtggatg tgtcaaaacg cataccattt tgaacgatga cctctaataa ttgttaatca 1860 
tgttggttac gtatttatta acttctccta gtattagtaa ttatcatggc tgtcatggcg 1920 
cattaacgga ataaagggtg tgcttaaatc gggccatttt cgctaataag aaaaaggatt 1980 
aattatgagc gaattgaatt aataataagg taatagattt acattagaaa atgaaagggg 2040 
attttgcggc cgccaacctc gagatctctt agatttttgg ggttatttag gggagaaaac 2100 
ataggggggt actacgacct cccccctagg tgtccattgt ccattgtcca aacaaataaa 2160 
taaatattgg gtttttaatg ttaaaaggtt gttttttatg ttaaagtgaa aaaaacagat 2220 
gttgggaggt acagtgatgg ttgtagatag aaaagaagag aaaaaagttg ctgttacttt 2280 
aagacttaca acagaagaaa atgagatatt aaataggaat tcgagctcat tattaatctg 2340 
ttcagcaatc gggcgcgatt gctgaataaa agatacgaga gacctctctt gtatcttttt 2400 
tattttgagt ggttttgtcc gttacactag aaaaccgaaa gacaataaaa attttattct 2460 
tgctgagtct ggctttcggt aagctagaca aaacggacaa aataaaaatt ggcaagggtt 2520 
taaaggtgga gattttttga gtgatcttct caaaaaatac tacctgtccc ttgctgattt 2580 
ttaaacgagc acgagagcaa aacccccctt tgctgaggtg gcagagggca ggtttttttg 2640 
tttctttttt ctcgtaaaaa aaagaaaggt cttaaaggtt ttatggtttt ggtcggcact 2700 
gccgacagcc tcgcagagca cacactttat gaatataaag tatagtgtgt tatactttac 2760 
ttggaagtgg ttgccggaaa gagcgaaaat gcctcacatt tgtgccacct aaaaaggagc 2820 
gatttacata tgagttatgc agtttgtaga atgcaaaaag tgaaatcagc tggactaaaa 2880 
ggcagagctc ggtacccggg agctctatca attggtaact gtatctcagc ttgaagaagt 2940 
gaagaagcag agaggctatt gaataaatga gtagaagcgc catatcggcg cttttctttt 3000 
ggaagaaaat atagggaaaa tggtacttgt taaaaattcg gaatatttat acaatatcat 3060 
atgttacaca ttgaaagggg aggagaatca tgaaacaaca aaaacggctt tacgcccgat 3120 
tgctgacgct gttatttgcg ctcatcttct tgctgcctca ttctgcagcc gcggcacacc 3180 
ataatggtac gaacggcaca atgatgcagt actttgaatg" gtatctacca aatgacggaa 3240 
accattggaa tagattaagg tctgatgcaa gtaacctaaa agataaaggg atctcagcgg 3300 
tttggattcc tcctgcatgg aagggtgcct ctcaaaatga tgtggggtat ggtgcttatg 3360 
atctgtatga tttaggagaa ttcaatcaaa aaggaaccat tcgtacaaaa tatggaacgc 3420 
gcaatcagtt acaagctgcg gttaacgcct tgaaaagtaa tggaattcaa gtgtatggcg 3480 
atgttgtaat gaatcataaa gggggagcag acgctaccga aatggttagg gcagttgaag 3540 
taaacccgaa taatagaaat caagaagtgt ccggtgaata tacaattgag gcttggacaa 3600 
agtttgactt tccaggacga ggtaatactc attcaaactt caaatggaga tggtatcact 3660 
ttgatggagt agattgggat cagtcacgta agctgaacaa tcgaatttat aaatttagag 3720 
gtgatggaaa agggtgggat tgggaagtcg atacagaaaa cggtaactat gattacctaa 3780 
tgtatgcaga tattgacatg gatcacccag aggtagtgaa tgagctaaga aattggggtg 3840 
tttggtatac gaatacatta ggccttgatg gttttagaat agatgcagta aaacatataa 3900 
aatacagctt tactcgtgat tggattaatc atgttagaag tgcaactggc aaaaatatgt 3960 
ttgcggttgc ggaattttgg aaaaatgatt taggtgctat tgaaaactat ttaaacaaaa 4020 
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caaactggaa ccattcagtc tttgatgttc cgctgcac^ ScStcJg caalgacatc 4140 
aSgcgSgg gaattatgat atgaggcaaa ^atttaatgg tac^g^^gcg ^^^^^^^^g, ^oo 
caatgcatgc tgttacattt g^tg^^aatc |^9ar g ^^^^^^^^^ cgtgaacaag 4260 
cttttgtt^ga agaatggttc aaaccattag ^9^^ | aacgcatggt gtaccagcga 4320 
gctaclcttc tgtattttat fg^gattatt atggcattcc J 9 ggaagacaaa 4380 

tgaaatcgaa aattgacccg attctagaag °9°9"»»* tgaagggaat acagcacacc 4440 
atgactaltt agaccatcat aatatcatcg |"9gaoacg ^ga ggg tgtttg 4500 

cclactccgg tttagctact atoatgtccg atgggcagg g ^^^^^ gcaggtactg 4560 
ttgggcgtaa taaagctggt caagtttgga cc9atatcac 3| gtttctattt 4620 

ttiliattaa tgctgatgga tggggtaatt "tctgtaaa ^ gagggtttac 4680 
gggtaaacaa ataagtcgac ggc«agccg 9=°^acagg^ gcagccacac atgtgttgaa 4740 
Itcatggcaa tacttgttac tggcggtgcc 9g^acatcg 9 | ttcagctgaa 4800 

ctlttilaca gcggctacga gattgttgtt ^^g^^aatc tgj^ | ^g^ggattta 4860 
gcgctgaacc gtgtcaagga gftacagga aaagatttaa 9 tgtgattcat 4920 

ttggaccggg aagcggtaga ttccgttttt S^tgaaaatg ^ ttatcataac 4980 

tt?gcagggt taaaagcagt cggcgaatct gtggcgttc -^^^^g^^^ ,,agaaaatc 5040 
aatttgacag gaacgtttat tttatgcgag 9 ^ a ^ catcgccgat tacggaagac 5100 
gtattcagtt catctgcgac agtatacggc gttccggaaa | acaaatattg 5160 

Ittccattag gcgcgacaaa tccttatggg ^^9^9^^ tgctlcgtta ctttaacccg 5220 
cgtgatttgc atacagccga caatgagtgg agcgttgcgc 9 ^ aaataacctt 5280 
ttcggcgcgc atccaagcgg acggatcggt gaagacccga acgg attcggaaat 5340 

atgccgtalg tggcacaggt agcagtoggg aagctcgagc aattaag g ^ ^^^^^^g^^ 5400 
galtalccga caaaagacgg gacaggcgta ^gcgattata ^tc g 3^ ticatacaac 5460 
laaggccacg tcaaggcgct ggaaaaagta ttgaactcta ^^9 9 aaaagtgtca 5520 
cttlgaacag gcacaggcta cagcgtgctg gaaatggtca aag a cacatgcttt 5580 
gggaaagagg ttccataccg ttttgcggac cg^^gt^^gg 9^11^^ ccttgaggaa 5640 
Slgatccg cgaaagccaa gcgagaacta ggcgggj^l t?2gStaI gagt|cggaa 5700 
ralSSgS ™S SfgiSS cSSStIa agfacaaggg tgcctaaaca 5760 
ggcacccttg ttagctgtta tttgattttc acg 



<210> 4 
<211> 29 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: 
B5860H10 



<400> 4 29 
ttacatccgc gggtgaggaa agacaggac 



<210> 5 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

:22?> Description of Artificial Sequence: Primer 
B5860H11 

<400> 5 ^^4-^^ ^'^ 

tagtgaattc agaaccggtc cacatcc 

<210> 6 

<211> 30 

<212> DNA \ 

<213> Artificial Sequence 

<220> 
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<223> Description of Artificial Sequence: Primer 181804 
<400> 6 

tgttcccgag aatggaggcc ttctcaattg 



<210> 7 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 181805 
<400> 7 

tggttgtcga catctgaggg aggtacaatt gtagctg 



<210> 8 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 188502 
<400> 8 

ttttcatcga tactagtgtg cacggatcca tctgaaggtc gatacggg 



<210> 9 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 188836 
<400> 9 

ttgtttgtcg acgcaaagct gttttatgaa ttctcc 



<210> 10 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 190694 
<400> 10 

ttttggccca gccggccaac aggtcatttt ttaggaggg 



<210> 11 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenc : Primer 190695 
<400> 11 

ttattggatc cgtgaaaatc aaataacagc taacaaggg 

7 
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<210> 12 \ 
<211> 33 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 190697 
<400> 12 

ttttcatcga taacaggtca ttttttagga ggg 33 



8 



